
NGL-0 5-01 8- i04'i 


■'US GEE Report 480 




i8M 



UNIVERSITY OF SOUTHERN CALIFORNIA 


■s tn cp 
os ♦ r••' 

H w '+4 *- 
t-t H 

*3} X/) r-) t-3 

N> ix it) O 
Htoun 
a u 

0 ss d 
« O M 
SH3J 
O B Jd 
25 

X O 3 
to H O 
25 V) 
fin & 
o S fi-t 
s o 
>* o 

O V >1 
I O 4J 
■ €-) i-3 -H 
^ W 
U (4 
«S H <U 
1 B t» • 
pj -H 

{ o a'. 

1 r- O- 
| «5 05 

r- o 

-J3 fin 4J ' 

I rn M 
! r- O O 
i i n a 

i ft D © 

'■ y a« • 

; I H 
< Js <-< 

‘ V) 33 (Cl 

«fi o c 
23 M H 

J «— g-t fcj 


A STUDY OF SYNCHRONIZATION TECHNIQUES >, 

V ' 'A;.:. \ " ' ■" ! " 

FORrpETlCAL COMMUNICATION SYSTEMS 


R. M. Gagliardi ^ 'j 

January 1975 SUBJECT T6 

Final Technical Report 


Prepared for 5 

National Aeronautics and Space Admiry®trati*^' 
Office of University Affairs > 
Washington,, D. C. 20546 J 


ELECTRONIC SCIENCES LABORATORY 




NGL- 05-01 8-104 


USCEE Report 480 


A STUDY OF SYNCHRONIZATION TECHNIQUES 
FOR OPTICAL COMMUNICATION SYSTEMS 


R; M. Gagliardi 


January 1975 


Final Technical Report 


Prepared for 

National Aeronautics and Space Administration 
Office of University Affairs 
Washington, D, C, '20546 


This work was sponsored by the National Aeronautics and Space 
Administration, under NASA Contract NGL-05-0 1 8 - 1 04. This grant 
was part of the research program at NASA's Goddard Space Flight 
Center, Greenbelt, Maryland. 


I 



TABLE OF CONTENTS 


Page 

1. INTRODUCTION 1 

2. PROGRAM OBJECTIVE 2 

3. SUMMARY OF PROGRAM ACCOMPLISHMENTS 4 

4. PROGRAM , DOCUMENT LIST 5 

5. APPENDIX (SELECTED CONTRACT REPORTS) H 


/I 



1. INTRODUCTION 


This document is a final report of work done in the Department of 
Electrical Engineering at the University of Southern California for 
the National Aeronautics and Space Administration, in the area of 
optical communications. The work effort was carried out under the 
guidance of Professor Robert M. Gagliardi of the Electrical Engineering 
Department, and covered an extended period commencing in 1969 and 
ending January 31, 1975. The work was initiated as a joint research 
effort between the University of Southern California and NASA's 
Electronic Research Center in Cambridge, Massachusetts, The work 
was later monitored by the Electro-optics Division at the Goddard 
Spaceflight Center at Greenbelt, Maryland. The contract was funded 
by NASA's Office of University Affairs under Grant NGL 05-018-104. 

The objective of the program was to study synchronization techniques 
and related topics in the design of high data rate, deep space, optical 
communication systems. The research was solely analytical in nature 
and was divided into two basic categories. The first involves tasks 
with direct application to the time synchronization problem, while the 
second involves related areas also being studied under the grant. The 
study was to indicate design procedures, assess system performance 
and predict future areas of needed study in synthesizing and improving 
digital optical systems. 

This final report reviews the program objectives, the significant 
results, and the published research work generated during the program 


tenure. 



2. PROGRAM OBJECTIVE 


This study program was initiated in December 1968 at a time when 
NASA was vitally interested in developing a high data rate, deep space, 
optical communication system. The primary mode of operation was to 
be direct detection digital transmission, with interest in possible block 
encoding to achieve improved data rates. Use of narrow pulsed optical 
sources was expected to be the principle signalling format. At that 
time, some questions existed concerning the ability to time synchronize 
low duty cycle optical systems for bit and word detection. For these 
reasons the study program was initiated. 

The specific work tasks of the program were: 

1) To determine the effect of timing errors in narrow pulsed digital 
optical systems. This task would allow for a determination of the 
required timing needed in system design in order to maintain necessary 
temporal coherence. At the time of program commencement accurate 
statistical models for optical detection were only partially known. Thus, 
a subtask here was the development of usable system models for analysis 
of bit and word error probabilities for both perfect and imperfectly 
timed systems. 

2) To determine the accuracy to which well known microwave timing 
systems can be operated in a low powered optical system. 

3) To derive improved tracking systems for the optical channel. 
Also, to determine the degree of improvement that can be expected by 


- 2 - 


these newer systems, and possible upper bounds to tracking performance. 
This would allow comparison to present state of the art systems currently 
in existence, for a cost effective study of redesigning synchronization 
systems. 

Other areas of interest closely related to the above primary tasks 
were also to be considered: 

4) An establishment of a usable photodetector mathematical model 
for application to the analysis and design of performance in a communi- 
cation receiver. 

5) A study of the application of multi-level block encoding to the 
optical transmissions of digital data, and possible improvements in 
transmitted information rates. 


- 3 - 



3. SUMMARY OF PROGRAM ACCOMPLISHMENTS AND RESULTS 

Since the study effort was solely analytical in nature, the results 
of the program were technical reports, summarizing the achieved 
milestones. The program produced a total of 1 1 technical reports, 

12 published papers, and 2 Ph. D, dissertations. The key accomplish- 
ments of the program are summarized below. References refer to 
the listed reports in Section 4, where the stated results are documented. 

1) Developed an accurate mathematical model of the photodetecting 
receiver and its statistical properties for use in digital receiver design. 
Specifically, a detailed study was made of the detector shot noise process 
and its interrelation with the counting processes that govern it. Investi- 
gation of the conditional Poisson counting process (referred to more 
recently as a doubly stochastic Poisson process) was made in depth, 
exploring the relation of Poisson, Daguerre, and Bose-Einstein counting. 
The relation of optical shot noise to Gaussian processes was studied. 
References - Section 4.1 [4,6,7]; Section 4.2 [3,4,6], 

2) Studied the pulse position modulated (PPM) mode of optical 
digital transmission, showing its optimality and practical system 
implementation. The results were extended to block encoded systems 
and resulting error probabilities were derived. A computer program 
was developed for computation of PPM system performance under all 
possible operating conditions. Section 4. 1 [l , 2, 3] ; Section 4.2 [l , 2, 6, 9] . 

3) Determined the effects of timing errors in both PPM and on-off 
keyed digital systems. Section 4.1 [ 8] ; Section 4. 2 [5, 7] . 


- 4 - 


4) Determined the ability to time and phase lock in phase and pulse 
tracking subsystems following photodetection in direct detection optical 
systems. Relations between tracking errors and operating signal to 
noise ratio were developed. Section 4, 1 [5, 9 1 ; Section 4. 2 [ll - ]; 

Section 4. 3 [l *] . 

5) Determined the optimal tracking system for optical systems. 

The approach here was to invoke the use of estimation theory and treat 
the tracking problem as one of estimating arrival time of a synchronizing 
signal. The optimal tracker was then determined as an optimal esti- 
mator of arrival time. This also allowed for a study of signal wave- 
shape for best obtaining the timing information. Section 4. 1 [lo] ; 

Section 4. 2 [lo] . 

6) Investigated digital signalling procedures other than PPM that 
aid in overcoming the time accuracy problem. Although PPM systems 
are optimal for perfectly timed systems, it was shown that use of 
noncoherent, multi-level frequency shift keyed systems using harmonic 
square waves are more efficient at high data rates. This study effort 
represents a new area of research not included in the task statement. 
Studies in this area are not completed. Section 4.1 [ll]; Section 4,2 
[l2]; Section 4.3 [2], 

7) Extended the problem of time tracking to spatial pointing, 
acquisition, and spatial tracking. Developed the relationship among 
power levels, pointing accuracy, performance, and acquisition times 
in locating spatially positioned transmitters. Computer programs have 


- 5 - 



been developed for this purpose. Studies in this area are still under 
investigation and research has not been completed. 

There were not patents or inventions produced from this research. 


-6 - 



4. PROGRAM DOCUMENT LIST 

The following lists all research and technical reports, published 
papers, and documents generated from, and accredited to, this study- 
grant. 


- 7 - 





4. I Technical Reports 

[l] NASA Technical Note TN D-4623, "M-ary Poisson Detection and 
Optical Communications,” S. Karp and R. Gagliardi, June 1968. 





NASA Technical Note TN D-4814, "Design of PPM Optical 
Communication Systems, ” S. Karp and R. Gagliardi, October I 968 . 

NASA Technical Note TN C-40, "Error Probabilities for Detection 
of M-ary Poisson Processes in Poisson Noise," S. Karp, G. 
Hurwitz and R. Gagliardi, May 1968. 

USCEE Report 334, "On the Representation of Continuous Stochastic 
Intensities by Poisson Shot Noite, " R. Gagliardi and S, Karp, 

March I969. 





rf[5] 


,0^ 
V [61 


USCEE Report 396, "Optical Synchronization - Phase Locking 
With Shot Noise Processes," R. Gagliardi and M. Haney, August 
1970. 

USCEE Report 397, "Communication Theory for the Free Space 
Optical Channel, " R. Gagliardi, S. Karp and E. O'Niell, August 
1970. 


. [ 7 ] USCEE Report 401, "Counting Statistics for Extended Optical 

y.'l’'' Photodetectors," R. Gagliardi and V. Farrukh, January 1971, 

f 

dP [si USCEE Report 406*, "The Effect of Timing Errors in Optical Digital 
Systems," R. Gagliardi, August 1971. 

a, f Ls-s [9] USCEE Report 426, "Synchronization Using Pulse Edge Detection 

Optical PPM Communication Systems," R. Gagliardi, September 

iflr 1972. 

^W^ClO] USCEE Report 448, "MAP Synchronization in Optical Communication 

- It i 1 * -r-k 1 • 1* if 1 * a < 1 1 /s n 


0 ^ 



Systems," R. Gagliardi, N. Mohanty, April 1973, 

[ll] USCEE Report 471, "Noncoherent Detection of Periodic Optical 


■rKl"* 


Signals," R, Gagliardi, April 1974. 


- 8 - 



fl] 

[ 2 ] 

[3] 

[4] 

[5] 

[ 6 ] 


Published Papers 

R. Gagliardi and S. Karp, "M-ary Poisson Detection and Optical $ 
Communications, " IEEE Trans, on Communication Technology, 

Vol. CT-17, No. 2, April 1969, pp. 208. 

S. Karp and R. Gagliardi, "The Design of PPM Optical Communi- fr'? ^ 
cation Systems, " IEEE Trans, on Communication Technology , 

Vol. CT-17, December 1969, pp. 670. 


f05$> 



R. Gagliardi and S. Karp, "On the Representation of a Continuous 
Intensity by Poisson Shot Noise, " IEEE Trans, on Info. Theory , 
Vol. IT-16, No. 2, March 1970. 






S. Karp, R. Gagliardi and E. O'Neill, "Communication Theory 
for the Free Space Optical Channel, " Proc, of the IEEE , Vol. 58, 
^To. 10, October 1970, pp. 1611. i 






R. Gagliardi, "On the Timing Problem in Optical Digita Systems," 
Proceedings of the International Telemetry Conference , September 
1971, Washington, D. CJ. 




R , Gagliardi, "Photon Counting and Laguerre Detection," IEEE fi 7 3 - 
Trans, on Info. Theory , Vol, IT -18, January 1972, pp, 208. 


[7*] R. Gagliardi, "The Effect of Timing Errors in Optical Digital ^ 7 ^ -25$% S' 
Systems, " IEEE Trans, on Communication Technology / Vol, 

CT -20, No. 2, April 1972. 


[8] N. C. Mohanty, "On the Identifiability of Finite Mixtures of %¥ 

Laguerre Distributions, " IEEE Trans, on Information Theory , 

Vol. IT-18, No. 4, July 1972. 

[ 9 I N. C. Mohanty, "M-ary Laguerre Detection," IEEE Trans. A >73.3/75 >/ 
on Aerospace and Electronic Systems, Vol. AES, May 1973. 

[10] N. C. Mohanty, "Estimation of Delay of M PMM Signals in 
Laguerre Communications," IEEE Trans, on Communication, 

Vol. COM-22, No. 5, May 1974. ~ 

[11] R. Gagliardi, "Synchronization Using Pulse Edge Tracking in 
Optical PPM Communication Systems, " IEEE Trans, on Comm . , 

Vol. COM- 22, No. 10, October 1974. ” ~ ' ~ 


[l2] R, Gagliardi, "Noncoherent Detection of Periodic Optical Pulses, " 
IEEE Trans, on Information Theory , Vol. IT-5, May 1975 (to be 
published). 


- 9 - 



4.3 Ph.D. Dissertations 



[l] "Phase Locked Loop Tracking of Shot Noise Processes, 11 by- 
George Michael Haney, presented to the Graduate School of 
the University of Southern California, January 1971. 



[2] "Non-coherent Detection of Subcarrier Frequencies in Direct 

Detection Optical Communication Systems, " by Richard A. Maag, 
presented to the Graduate S ch ool of the University of Southern 
California, February, 1 9? 5 4to be published). 


- 10 - 



5. APPENDIX 


Reprints of most of the reports in Section 4 are included. When a 
report appeared as both a technical document and a published paper, 
only the paper was included. The reports included are listed in the 
following order*, [numbers refer to their listing in Section 4. ] 


4.2 


4. 1 


[ 1 ] 

[ 2 ] 

[3] 

[4] 

[ 6 ] 

[7] 

[ 8 ] 

[9] 

[ 10 ] 

[10 

[11 


-11 - 



April 1973 


USCEE Report 448 


Interim Technical Report 


MAP SYNCHRONIZATION IN OPTICAL 
COMMUNICATION SYSTEMS 



R. M. Gagliardi 
N. Mohanty 

Department of Electrical Engineering 
University of Southern California 
Los Angeles, California 90007 


This work was sponsored by the National Aeronautics and Space 
Administration, under NASA Contract NGR-05-01 8- 104. This grant 
was part of the research program at NASA's Goddard Space Flight 
Center, Greenbelt, Maryland. 


ABSTRACT 


The time synchronization problem in an optical communication system 
is approached as a problem of estimating the arrival time (delay variable) 
of a known transmitted field. Maximum aposteriori (MAP) estimation 
procedures are used to generate optimal estimators, with emphasis 
placed on their interpretation as a practical system device. Estimation 
variances are used to aid in the design of the transmitter signals for best 
synchronization. Extension is made to systems that perform separate 
acquisition and tracking operations during synchronization. The closely 
allied problem of maintaining timing during pulse position modulation is 
also considered. The results of this report have obvious application to 
optical radar and ranging systems, as well as the time synchronization 


problem. 


Introduction 


An important requirement in a successful communication system is to 
maintain accurate timing between transmitter and receiver. This timing 
is generally achieved by having the transmitter continually send a known 
clock signal to which the receiver can synchronize. For the system to be 
time locked, the receiver synchronization subsystem must determine the 
exact time at which the clock signal arrives. This measurement of clock 
arrival time can be considered a measurement of transmission delay time, 
which can be used to continually adjust the receiver clock relative to that 
of the transmitter. An analytical approach to the design of synchronization 
subsystems is to consider this arrival (delay) time measurement as an 
estimation problem. In this context, optimal estimators for measuring 
delay can then be implemented as practical devices for achieving 
synchronization. 

In an optical communication system the arrival time measurement is 
hindered by both the quantum effects of the photodetection operation and by 
the reception of background noise radiation in the optical antenna. In this 
report the design of synchronizing subsystems in optical receivers is examined 
from an estimation point of view. Maximum aposteriori (MAP) estimators 
of delay are derived for both quantum limited and background additive oper- 
ation, and their interpretation as practical subsystems are explored. 

Problem Formulation 

Let the timing information be sent from transmitter to receiver in the 
form of a known optical field f(t, rj where t, r^ are the temporal and spatial 


- 2 - 


variables. The transmitted field is detected by the receiving system shown 
in Figure 1. A photodetector, having spatial area Cl normal to the beam 
propagation, intercepts the optical field producing the detector output signal 
x(t). The detected field has the intensity |f(t-T,r_)| where t is the time 
delay during transmission. If we assume the field was transmitted at 
t = 0, then T is alternatively the time of arrival of the field at the receiver. 
The detector output x(t) is given mathematically by the shot noise process 


k(0, t) 

x (t) = c 

m=0 


h(t-t ) 
m 


( 1 ) 


where h(t) is the detector response function, c is a proportionality constant 
related to electron change and detector impedance, {t } are the random 
location terms of the emitted photo electrons, and k(t , t ) is the random 
number emitted during (t^t^). The latter is called the detected count 
process and in the absence of background field noise, is known to have a 
Poisson count probability with intensity parameter 


m(tj, t 2 ) 


n(t-T )dt 


( 2 ) 


T 


where 


n(t-T) = 


|f ( t — T , r ) | dr 


(3) 


a 


Ct = photodetection parameter 


The function n(t) is the spatially integrated field intensity and is called 
the count intensity function. When bandlimited Gaussian white background 


-3- 


noise is present, the count over (t , t ) is known to have a Laguerre count 
probability: 


Prob[k(t ,t )=k] 





(4) 



where Q is the number of time-space modes observed over (t .t^) and O, 
and is the average noise count per mode. 


The detector time process x(t) in (1) is then processed in the sync 
subsystem, herein considered a device that produces an estimate of the 
arrival time T. This estimate can then be used to clock all subsequent 
receiver operations requiring transmitter synchronization (e. g. bit timing, 
ranging, etc). In typical system operation, this timing must be continually 
updated and the estimation of t must be repeated by continually retransmitting 
the optical field. For this reason the optical field, and therefore the intensity 
n(t) in (3)j is considered a periodic wavefrom in t with repetition period T. 

A receiver observation of T sec therefore corresponds to one period of the 
intensity waveform. The estimation problem is therefore one of observing 
over (0,T) the photo detected output due to repeated optical field producing 
the count intensity n(t-T), and estimating the variable T. Although we shall 
concentrate on the estimation problem over a single inverval, the resulting 
processing may then be repeated over subsequent intervals, making use of 
earlier estimates. Only maximum aposteriori (MAP) estimates are 
considered. The procedures of MAP estimation are discussed in References 
[1-3] , and the specific application to optical systems is reviewed in [4], 

The pertinent equations necessary for this report are summarized in the 


Appendix. 


-4- 


MAP Estimation of Delay 

The MAP estimate of T under Poisson counting follows directly from 
the Appendix, with T replacing 9. Since n(t) is periodic with period T, it 
can be expanded into a Fourier series at harmonics of frequency 1/T, 
each of which integrates to zero in the third term of (A- 5 ). Furthermore, 


dn(t-T) 

dT 


dn(t) 

dt 


t“»t-T 


The MAP estimate T is then that T for which 


(5) 


max 

T 

or that satisfying 



x(t) log[n(t-T)]dt + log p(T) 



( 6 ) 


(7) 


when the intensities are differentiable. The optimal estimator in (6) 
corresponds to determining the maximum of a bank of crosscorrelations 
of the detector output with all possible delay shifts of In n(t), as shown in 
Figure 2a. Alternatively, the integral can be interpreted as the output at 
time T of a passive filter whose input is x(t) and whose impulse response 
is In n(-t), as shown in Figure 2b. The filter output at every t is then 
weighted by log p(t), and the value of t producing the maximum is the MAP 
estimate of t. 

2 

When p(T ) is Gaussian with mean m^ and variance o ^ then (7) is 


convenient to use, and takes the form 


-5- 



T 


The MAP estimate T appears on both sides and an explicit solution is not 
immediately available. However, we can interpret the integral as a 
correlation of the detector output with a delayed version of the bracketed 
expression. Hence, the MAP estimate is the value of T which forces this 
right hand side to equal T. This suggests an estimator similar to that 

A 

shown in Figure 2c, employing a feedback loop to generate the proper T 
to force the loop to lock in (when t is correct the output of the correlator 
is that necessary to maintain the loop). Note the loop involves crosscorrelations 
with the time derivative of log n(t) and the specific form of the loop signal 
generator depends upon the transmitted intensity. If n(t) is a pure sinusoidal 
intensity the feedback loop specializes to the tan-lock loop [4], If n(t) is 
periodic, but non- sinusoidal, the form of the MAP estimator loop changes. 

For example, let 

n(t) = e " (t /2D -T/2*t£T/2 (9) 


representing a Gaussian shaped intensity pulse of width D and energy <$, 
extended periodically in time, as in Figure 3a. We assume T is many 
times larger than D so that the pulse occupies a relatively small portion 
of the observation interval and end effects can be neglected. For this case. 


d In n(t) 


(■£)■■■• 


dt 


_d 

dt 


_t_ 

D 


( 10 ) 


- 6 - 


and (8) becomes 


T - m 


T 

= [ x(t)(t-T)dt 

D J 0 


Hence, 


(ID 


T = 


J o T tx(t)dt + m T ^y 


( 12 ) 


The integral in the denominator is the observed total number of counts 

k(0,T). The numerator integral is the "mean", or "center of gravity", 

of the observed detector process x(t). The MAP estimator therefore 

computes the "mean" or "center of gravity" of the shot noise locations in 

time and uses it in (12). In the typical situation the initial delay uncertainty 

2 2 

is many times the pulse width so that a T ^ D , and the MAP estimate is 
precisely this mean location time. 

It is interesting to see how the estimator changes form as the optical 
pulse becomes sharper in form. Consider the pulse in Figure 3b, with its 
log derivative shown in Figure 3c. Equation (11) becomes instead 


A 

T 



T+e T+D+2e 

x(t)dt - x(t)dt 

J A J A _ 

t T+D+e 


(13) 


T 

The feedback estimator now corresponds to the short term integration over 
the front and back end of the expected optical pulse, as the pulse is swept 
through the observation interval. In essence, the estimate is that value of 


-7- 


T that "locks up" equal e sec integrations separated by D sec, as shown in 
Figure 4. Effectively the detector output is being "gated", and the tracking 
loop that implements (13) is often called an early-late gate loop. Note that 
as e -♦ 0 in Figure 3b the pulse rise and fall time decreases, and the 
estimator integrates over a smaller portion of the observed output. Hence, 
as the optical pulse used for delay estimation is changed from a smooth 
Gaussian pulse to a sharper pulse waveform, the optimal estimator form 
changes from a center-of-gravity estimator to the early-late gate loop. 

The dependence on intensity waveform can be further pursued by 
investigating the Carmer-Rao bound for delay estimation given in the Appendix. 
For a given density p(t), the CRB decreases as the time integral in (A- 7 ) 
increases. Using (5) this integral can be rewritten as 


I' 


[dn(t)/dt]‘ 

n(t) 


dt = 




(14) 


where the integral is over all t in (0,T) for which n(t) i 0. By applying the 
Schwartz inequality to the right integral, we note that (14) is maximized if 


dn(t) _ d log n(t) 
dt dt 


n(t) 4 0 


(15) 


in which case it becomes 


r T 

0 


dn(t) 

dt 



log n(t) 


dt 


') 


dt 



(16) 


Thus, the integral in (14) is bounded by the energy of the time derivative of 
the transmitted intensity. By applying Parcival's Theorem, we can further 


write 


- 8 - 




( 17 ) 


where F (UU) is the Fourier transform of n(t) over one period. The integral 
n 

on the right can be interpreted as the mean squared frequency of the bandwidth 
of the intensity. Thus, the CRB for delay estimation is minimized if a 
transmitter intensity n(t) is used that satisfies (15) and has the largest mean 
square bandwidth in (17). The equality in (16) occurs only if n(t) = log n(t) + 
(constant) when n(t) i 0. This can be satisfied only if n(t) is constant whenever 
it is non-zero. Thus, (15) and (17) together suggest that best estimation 
(minimal CRB) corresponds to flat intensities, with as wide a frequency 
bandwidth as possible. The limit of such waveforms would be an ideal, 
rectangular, narrow pulse in time, although theoretically (16) is not valid 
for such intensities (the derivative of a pulse is not squared integrable). 

This pulsed intensity corresponds to transmission of a narrow burst of light 
and, in spite of the analytical difficulties, we intuitively expect such optical 
fields to indeed yield best delay estimation. (We may also note that the CRB 
for the intensity pulse in Figure 3b is approximately e/ 2<$ log (<$ /AD), which 
decreases directly with e and D, forcing the intensity to approach the ideal 
rectangular pulse. ) Even though the rectangular intensity is not differentiable, 
the correlator-integrator in (6) retains its meaning as a short term integration 
over the pulse width, starting at each value of t. This is often called a 
"sliding window" integrator, and the delay point where the window maximizes 
(6) is the MAP estimate. Unfortunately, this theoretically requires a search 
over all values of T in (0,T), although this search time can often be reduced 


-9- 


by carrying out separate acquisition and tracking operations as discussed in 
the next section. 

When background noise is present, the counts are governed by the 
Laguerre probabilities in (4). The estimation equations (6) and (7) for 
Poisson counting must then be replaced by the discrete operations: 


and 


max 

T 



L^ ) + log p(T) 
i 


( 18 ) 


= ]Ck.C(k.,T) 

P(T) " 1 1 


d log m 


dr 


.(T)l 

H 


(19) 


where k. = k(t. + At,t.), m.(T) = m(t„+At,t.), At is the counting interval 
l i l x 2 i 

(reciprocal of the detector bandwidth), and 


C(k.,T) = 1 - — 

1 h. 

x 

with the Laguerre functions having argument m. (t ) /N^(l +N^ ). The summations 
represent modified forms of the correlation operations, and involve the 
coxxnt sequence over At sec intervals at the photodetector output. 

Acquisition and Tracking in Pulse Delay Estimation 

Let us consider the delay estimation problem using ideal rectangular 
pulses of width D, and let us write the delay T in the form 


T = jD + t , k = integer; 0 ^ ^ D (20) 


- 10 - 


We are here dividing the delay into an integer multiple of pulse widths plus 
an additive excess portion T . We can now show that the MAP estimate of 
t can be obtained as T = jD + t . That is, by simultaneously determining 
MAP estimates of j and T and substituting into (20). This follows since 
the joint MAP estimate of j and T ^ must satisfy the simultaneous equations: 


9 P(j. T Q /x(t)) 

a? 

a p(j» %/x(t)) 


( 21 ) 


where p(j,T^/x(t)) = p(T/x(t)) with T= jD + T . On the other hand, the MAP 
estimate of T = jD + satisfies 9(p(T/k)/9T = 0. However, 


dp(T/k) _ dp(T/k) _ dj &p(T/k) . dT Q 

9t 9j dT 9 t dT 

J 0 


dp(jD+T Q /k) 1 3p(jD+T Q /k) 

T7 • 7T + r* = 0 

9i D ot 

J 0 


( 22 ) 


If j and T q simultaneously satisfy (21), then (22) is also satisfied with 
T= jD + Tq. Thus, delay estimates T can be obtained by estimating individually 

A ^ 

the number of pulse shifts j and the amount of excess, t The estimation of j 
can be considered an acquisition problem (acquiring which interval the pulse 
is in), while estimation of T can be considered a tracking problem (tracking 
the excess shifts within a pulse interval). In synchronization, the time 
delay t generally does not vary more than a pulse width from one observation 
interval to the next. This suggests an alternative, suboptimal procedure in 
which we obtain first a pure MAP estimate of j alone in one interval, then 
using j to estimate Tq in the subsequent interval. The system achieves initial 


- 11 - 


acquisition first, then carries out tracking over later observation intervals. 
The system is easier to implement and reduces search time, but we 
emphasize that it generally does not yield the joint MAP estimates required 
in (21). 

To formulate the initial acquisition problem we model the observable as a 
vector sequence k of counts k. over disjoint pulse widths D in (0,T). [This 
is equivalent to considering the h (t ) functions in (1) as rectangular of width 
D and sampling the shot noise x(t) every D sec. ] If we assume an initial 
apriori joint density p(j,T^), then we can determine the MAP estimate of j 
alone from 


max p(k, j ) 

j 


max 

j 


P 

P(k/j, T )p(j, T )dT 
J 0 " ° ° 


(23) 


For quantum limited operation we see that when conditioned on a particular 


j and Tg, the received rectangular pulse will influence only the j and j+1 
interval counts, all others producing zero counts. Thus, 

Ik 


p(k/j,T 0 ) = 


-4--g)r -4-^) 


k. ! 

e 

3 


/ T '\lk. 

r c?t t 

■ -t?3 ' 

0 

L D J 


l <$ T /D) k j +1 

1 o' ' -(e?T 0 /D) 

k. i ! 6 

3+1 


-S 


k. !k. , ! 
J 3+1 


(24) 


where c? is the received pulse energy. The MAP estimate of j is that value 
at which a maximum occurs in (23). Clearly, if we observe a count sequence 
of which two are non-zero, (24) is maximum for the non-zero k^ for any 


- 12 - 


(i. e. , j is the index of the first non-zero k.). If only one count is non-zero 

it can be labelled either by k* or k?^ ^ , and the MAP estimate is that producing 

th 

the maximum. Thus, if the q count is non- zero, we must compare: 

_D / T _ \ k 


p(k;j=q) 


■ r r 


— I p(j=q)p(T 0 /q)dT Q 



: (q ) 


(25) 


to 


P(k> j + l=q) = p(j=q- 1 ) 

= p(j=q-i) 

.th 


C C4 

(b)‘ 


— I p(T Q /q-l)dT 


“V ,q - l; 

q 


(26) 


where rm(q) is the i moment of the conditional density p(T^/j=q). Thus, 
if only one count is non- zero the above moment sequences of the apriori 
density p ( t ^ / j ) must be computed to determine initial MAP acquisition. If 
we assume the most practical case where p(j) is uniform over the integers, 
and p(Tg/j) is uniform over (0, D) [initial delay is uniformly distributed over 
0, T ] then m. (q) = /i+ 1 for all q, and both (25) and (26) have the value 

a 

D/k+1. Thus, in the uniform case, we can equally likely select q as j or 

A 

j + 1. If no counts are non-zero we can only estimate j from its apriori density. 

A 

Once j has been determined (initial acquisition achieved) in a particular 
observation interval, it can be used as the true j in subsequent observation 


intervals in which tracking (estimating T ) is accomplished. With j given. 


-13- 


the estimate is that value for which d In p(k/j, T^)/dT^ = 0, or that 


satisfying 


-L 




+ k? , 
T- j + 1 


(v°) 


The solution is then 


= 0 


(27) 


T 0 \ k 


k: , \ 

— 1 — ) D 

c? . + k? / 
J+ 1 J ' 


(28) 


Thus, estimation of delay with rectangular pulses in quantum limited 

A 

detection can therefore operate by first acquiring j during one observation 
period, then computing (28) in the next. The latter uses the observed count 
ratio as the fraction of the pulse width for the excess shift. As observations 
are made over subsequent intervals, (28) can be continually recomputed to 
keep track of changes in t^. We emphasize that we have assumed that j 
does not change throughout all intervals. If for some reason the delay 

A 

jumps by several pulse positions, j must be re-estimated and the delay 
reacquired. 

The variance of the above estimator is difficult to determine explicitly 

since T ^ involves a ratio of random counts. In addtion, the CRB is 

hampered by the non-differentiability of the pulsed intensities. However, 

2 

a variance upper bound on T can be determined by noting that Var s D . 
Furthermore, if all counts are zero the variance is at most that of the 

• • j 2 

apriori density on T, o^, if we use the mean as the delay estimate. Thus, 


-14- 


Variance 


2 

= O [Prob k = 0 ] + (Var t )[ Prob k = 0 ] 

T — 0 — 

„ 2 -8 2 -8 

* o e + D (1-e ) (29) 


This shows the estimator variance is reduced to no more than the square 
of the pulse width D as pulse energy cS -> 0. 

When background noise is present, initial acquisition is more complicated 
since the non- signal intervals produce noise counts also. In this case (24) 
is replaced by 


P(k/j, T Q ) 


e -(«?/l+N 0 ) 
1 +N o 



L, (A)L (B) 
k j Vl (30) 


where k =y^k. , A = <S ( 1 -T^ /D) /N^( 1 +N^) and B = c?Tq/DNq( 1+N^). For a 
given count sequence k over a particular interval, we must determine 
j maximizing (23), which is equivalent to determining 

D 


max p(j) L (A)L (B)p(T / j)dr 
. K. K. . U U 

j o j j+l 


(31) 


Unfortunately, this mazimization must be found after integration over T 
However, we note that in comparing two different pair of indices, say 
(j^j^) anc ^ (j 3 »j^)» maximization of (30) is equivalent to comparing 


0 


>.J.) (B)p(T 0 /j 1 ) d T 0 

J 1 _ > , 

- D 

J Ik (AIL,, <B)p(T /j ,dT 0 
J- J + 


p(k»i 3 ) „D 


(32) 


when each j is equally likely. We now see that for any density, if 


-15- 


j i and ■> j^, then (32) exceeds one, due to the positiveness and 

monotonicity of Laguerre functions with their indicies. Thus, if any pair 
of successive counts are each greater than the corresponding members of 
any other pair of counts, the optimal estimate of j is always the index of 
the first of the larger. If no one pair dominates any other pair in this way, 
then one must resort to integrating first in (31). When does not depend on j, 
and is uniformly distributed over D, the integration in (31) can be performed, 
using the identity: 



L 


Q-l 

m+n+1 


(y) 


(33) 


After substituting, and using again the monotonicity of the Laguerre functions, 
(31) becomes 


max 

j 


J/?- 1 

r * 

I k.+k. 

1 J J+ 1 

Lv i+ vJ) 


max 

j 


{k.+k 

J 



(34) 


Thus, j is the index of the pair of consecutive counts having the largest sum, 
and initial acquisition is achieved by determining the maximal consecutive 
count pair. 

Lastly, we point out that the well accepted procedure of basing initial 

A 

acquisition on the largest of the counts (selecting j as that j for which k. is 

maximum) is equivalent to an assumption that = 0. For then B = 0 and 

A does not depend on T ^ in (31), and maximization over j is equivalent to 

maximization over k.. 

J 

Delay Tracking in PPM Digital Systems 

A problem closely related to pulse delay estimation in synchronization 
occurs when considering the tracking of pulse shifts in an optical PPM system. 


- 16 - 


In this operation an optical pulse D sec wide is sent in one of M possible 
D sec time intervals, and a random time- shift is added during trans- 
mission, independent of which pulse position is used. This added shift will 
cause PPM detection errors if not compensated, £ 5^] . A sync subsystem 
of the receiver attempts to measure the added shift during each word 
interval for proper receiver compensation. This measurement must be 
made, however, without regard to the pulse position modulation. Thus 
during each word interval the transmitted pulse arrives with a total delay 
t = jD + Tq as before, where j is the integer position due to the modulation 
and Tq is the added excess delay during transmission. The tracking problem 
can be formulated as one of estimating in the presence of the parameter j. 
Because of the position modulation, j must be considered independent from 
one observation interval to the next, and estimates of j in one interval cannot 
be used in subsequent intervals. Thus, during each observation of k, T ^ must 
be re-estimated in the presence of j. The resulting MAP tracking system for 
estimating T ^ depends upon the manner in which the index j is modeled. If 
j is considered an unknown parameter (no apriori density specified), then 
the maximization over must take into account all the possible values that 
i can take on. Thus, T_ is the value for which 

J Q 


max p(T /k) = 


max 

j 


max p(T Q /k, j) 

■ T o 


max CpcyisJ)] 




( 35 ) 


-17- 


This is equivalent to determining simultaneous maximizing values of 
and j, and therefore correspond to simultaneous estimates of these parameters. 
In other words, the MAP trackers must estimate both parameters each though 
only the estimate of T is of interest. Furthermore, both estimates must be 
obtained during each observation, and cannot be subdivided into acquisition 
and tracking, if real time solutions are desired. 

If a delay of one word interval is acceptable, a suboptimal tracking 
procedure would be one that first estimates j during the original observation; 
sfores the observation (detector output) for one word length, then reuses the 
stored observables, along with the estimate j, to determine t , as shown 
in Figure 5. The estimate of j can be made using the techniques similar to 
initial acquisition in synchronization. The tracking system is therefore 
attempting to first detect which interval contains the pulse (i. e. , decode the 
PPM word) then uses the decoded word to estimate t . In the literature, 
this is referred to as decision-directed estimation [2] and the resulting 
sync systems are called data-aided trackers [6, 1 ~\. 

If the word delay in data-aided systems in prohibited an alternative scheme 

A 

is that shown in Figure 6. Here estimates of are made consecutively 
with each successive pair of observed counts, and stored until the end of 
the observation interval. The estimate of j is then used to select the 
Tp corresponding to the most likely T . This operation avoids the word 
interval delay, but requires a bank of estimators. Both these systems are 
of course suboptimal since they do not necessarily produce the simultaneous 
maximization required in (35). 


-18- 


If, instead of treating j as a unknown parameter, we model it as a random 
variable taking on the values 1, 2, 3, . . . , M with equal probability, the MAP 
estimate of T can be obtained by averaging over these j values. Hence we 
write 


max p(T^/k) = max 

T T 

0 0 


M 


£ p (T <A j) 

L J=i 


(36) 


Since each term p(T^/k, j) is the conditional density of T when the pulse is 


transmitted in the 


th 


j position, only the k. and k.^ counts are necessary to 


estimate t^. (All other counts are either zero in the quantum limited case, 

or contain only noise counts, when background is present.) Hence, p(T /k, j) 

can theoretically be computed immediately after k, and k. , are observed. 

J J+ 1 

The summation in (36) is therefore a superposition of all such aposteriori 

densities, each delayed until the end of the observation interval. The 

estimate T is then made from this superposition. The system is shown 

in Figure 7. Note that the delaying of the aposteriori densities can be 

considered as modulation removal-eliminating the position shift due to PMM- 

and shifting the excess delay to the end of the interval, where the estimate 

is made. Note that this latter estimate is not simply the average of the 

individual MAP estimates at each value of i. If it is known that t is confined 

J 0 


to a narrow region about each pulse position, then (36) is approximately 


max {p( T 0 /k)3 

T o 


ma*(p(T 0 /kJ max )) 

T 0 


( 37 ) 


-19- 


where i is the j maximizing p(t, / i,k) over all T . 

max J 0 — 0 

identical to the simultaneous estimate of j and T ^ , and 
to the optimal MAP tracker defined in (35). 


The last term is 
therefore corresponds 


REFERENCES 


[l] Van Trees, H. , Detection, Estimation, and Modulation Theory-Part 1 
(book), Wiley, Inc., 1968. 

[Z] Hancock, J. and Wintz, P. , Signal Detection Theory (book), McGraw- 
Hill Book Company, 1966, Chapter 5. 

[3] Viterbi, A., Principles of Coherent Communication, (book), McGraw- 
Hill Book Co. , 1966, Chapter 5. 

[ 4 ] Gagliardi, R. and Mohanty, N. , "Estimation Theory and Optical 
Communications," USCEE Report 446, April, 1973. 

[ 5 ] Gagliardi, R. , "The Effect of Timing Errors in Optical Digital 
Systems," IEEE Trans, on Comm. Tech. , Vol. -COM-20, April, 1972. 

[6] Lindsey, W. and Simon, M. , "Data-Aided Carrier Tracking Loops," 
IEEE Trans, on Comm. Tech. , Vol. COM-19, pp. 157, April, 1971. 

[ 7 ] Gagliardi, R. , "Synchronization Using Pulsed Edge Tracking in Optical 
Systems," USCEE Report 426, September, 1972. 


APPENDIX 


Let k be an observable vector containing a real random parameter 0, 
and let p ( 8 ) be an apriori probability density on 9. The MAP estimate of 9, 

a 

given an observable k, is that 0 maximizing 


log p(0,k) = log p(k/0) + log p(0) 


(A- 1 ) 


where p(k/0 ) is the conditional density of k given the parameter 9. In 
optical systems, k represents the sequence of observed photoelectron 
counts (kj,k^,...)» each observed over a At sec counting interval. (At = 
1 /detector bandwidth). Under quantum limited operation, the conditional 
density is 


p(k/9) = j [ 3.(8)] kl exp[ - S.(6)]/k.! 


(A-2) 


where s.(®) is the count parameter over interval (t.,t.+At): 


l l 


t.+At 

= ! 1 n(t. 


-5.(9) = J* n(t, 9 )dt 
*i 


(A- 3) 


and n(t, 9) is the count intensity. The MAP estimate of 9 in (A-l) is that 
achieving 


max 

9 


| [k. log 5^(9) + 5.(9)] + log p(9)| 


( A-4a) 


The solution 9 must also satisfy the extremal condition: 


z 


s!( 6) 
C i 5 { (9) 


5|(Q) 


P'(9) 

P(6) 


= 0 


(A- 4b) 


0 = 0 


where the primes denote derivatives with respect to 9. As At -* 0, the 


continuous versions of these equations can be obtained, since £^(9) 

n(t, 9)dt and k. -*x(t), the detector shot noise process. Hence (A-4) becomes 


max 

6 


p r 

| x(t) log n(t, 9 )dt + log p(9) - n(t, 9)dt (A-5a) 
L "0 '0 


and 


r T 

0 



dt + 


P'(6) 

P(§) 


- j n' (t, 9 )dt = 0 

0 


(A-5b) 


The Cramer-Rao Bound lower bounds the MAP estimate, and is given by- 


CRB 



log[p(k/9)p(9)] 


a 2 e 



(A-6) 


where E is the expectation operator of k and 9. Using (A-2) in (A-.6) and 
averaging, yields 


r. 2 


5 log p(9 ) 


a 2 e 



[n 1 (t, 9)] 
n(t, 9) 



-1 


CRB 


(A-7) 


T'rCL'A.} - 

/T^rf+cv- 



x W 


•5^oc 


/■V 

T 


locf ffr) 


x&) 


» 


f 1 


lc 

><| 







h 

xi-t) 

N. 

4"//4cv- 

•+ 

r 


kt)-Iy'£ t ) 




dciz'rrtinc. 


y ffr) 

■ 

A 

o 

{L —h 


* 


(b) 




u.v e. 


l 






<juor- e. 


3 



F, 


1 


UV-£, 


4 



F, 




s 






I 

I 

I 

I 


F 




b 

CortopwLc. 

- 

7 







F, 


l<fUVC- 


7 


April 1974 


USCEE Report 471 


NONCOHERENT DETECTION OF PERIODIC SIGNALS 

R. Gagliardi 

Interim Technical Report 

Department of Electrical Engineering 
University of Southern California 
Los Angeles, California 90007 

New Title: " Noncoherent Detection of 


This work was sponsored by the National Aeronautics and Space 
Administration, under NASA Contract NGR-05- 1 08- 1 04. This grant 
is part of the research program at NASA's Goddard Space Flight 
Center, Greenbelt, Maryland. 


Periodic Optical Pulses" 
by R. Gagliardi 



Introduction 


Various modulation techniques are presently under study for communi- 
cating digital information over an optical channel. The most common 
method is by the use of pulse position modulation (PPM) in which digital 
words are transmitted as narrow optical pulses properly located within 
a data frame. Such systems however are hampered by the requirement 
to maintain a close tolerance on timing and synchronization in order to 
perform detection over the narrow pulses. An alternative encoding scheme 
that avoids the short pulse timing problem is by the use of coded frequency 
division modulation (FDM). In this case information is sent as frequencies, 
rather than pulse positions, and the synchronization problem is relaxed. 

One possible implementation scheme is to transmit the digital words as 
bursts of square waves of different frequencies, where the length of the 
square wave is selected to generate sufficient energy levels for detection. 
The encoded square wave is used to intensity modulate the optical beam. 

(A square wave is used rather than a sin wave because it has maximum 
baseband energy in a finite time for a fixed power contraint on the optical 
transmitter.) Following direct (non-coherent) optical detection in the 
photo detector the subcarrier square wave is detected (a decision is 
made as to which square wave frequency is being received) in order to 
decode the digital word. The timing need be maintained only to within 
the length of the square wave signal, which is many times the length of 
an optical pulse in a PPM system. 


u 


It is desired to implement the optimal detector for the set of square 
waves. Although the bit timing problem has been considerably reduced, 
there still exists a time referencing problem, since the square waves 
will be received with random delays. Hence, coherent correlation 
techniques cannot be used, and the optimal noncoherent FDM square 
wave detector is required. Unfortunately, noncoherent detectors for 
waveforms that are not narrowband are not known, even for the classical 
additive Gaussian noise channel. In this report we present the results of 
an initial study to derive the optical noncoherent detector for an arbitrary 
periodic waveform not necessarily of the narrowband type; e. g. , square 
waves. Attention is confined to only an additive Gaussian noise channel. 
The latter model is valid in an optical system when strong optical fields 
are detected. Future work will extend the results to the low power optical 
(poisson) channel. 


ill 


Analysis 


Classical non-coherent detection is generally understood to be the 
detection of a sin wave with random phase or time delay in additive 
gaussian noise. The problem is well documented in communication texts, 
and the Bayes optimal detector has been derived as both a matched 
envelope detector and a quadrature correlator - squaring device. These 
results have been expanded to include narrowband bandpass signals as 
well f 1 1 • However, the extension to a general non-coherent problem involving 
the detection of an arbitrary periodic signal with random time delay has 
received little attention. Closest documentation appears in the radar liter- 
ature where the problem is formulated as non-coherent detection of periodic 
RF pulses [ 2 ], but in all cases the narrowband assumption is imposed in 
order to derive an inter pretable solution. Admittedly, the general non- 
coherent problem may not be of great practical interest because of the 
bandwidths required to transmit all harmonics. Also, perhaps, the 
complexity of the general solution may have discouraged academic pursuit. 
Nonetheless, in this paper the general non-coherent problem is re-examined 
with the objective of interpreting the processing required by the optimal 
detector. 

Let p(t) be a general periodic, deterministic signal having period t^ 

and bounded energy. The signal is observed for T seconds with a random 

delay Tin the presence of additive white gaussian noise r(t). The observation 

time T will be taken as an integer multiple of t^ for convenience, although 

our results become an accurate approximation if T >>t . The observable 

0 

can therefore be written 


- 1 - 


- 2 - 


v (t ) = p(t-T) + n(t) te(0,T) 


( 1 ) 


For the non-coherent problem we assume t is uniformly distributed over 
(0,t^). The optimal (Bayes) detector for the signal is desired. Mathe- 
matically, the Bayes detector is that which computes the generalized 
likelihood ratio A obtained by averaging over T. For the observable of 
(1) this becomes 


A 


c fo 

C exp 

0 


T 

— | v(t)p(t-T)dt 


LOO 


dT 


( 2 ) 


where is the one-sided noise level and C depends upon v(t) but not on 
T. Since C can be computed without use of p(t) it is brought along simply 
as a constant in subsequent equations. This property of C also requires 
our assumption concerning the relation of observation time and signal 
period. Since p(t) is periodic, it admits a Fourier expansion which allows 
its delayed version to be written as 


p(t-T) 



( 3 ) 


where (a , ) are the harmonic amplitudes and phases of p(t), and 0 = 2TTT/t 

K K U 

is the uniformly distributed phase variable over (0, 2rr). The delay T therefore 
introduces a random phase to each harmonic of p(t), but note that these phases 
are related as rational multiples of each other. Using (3) in (2), and 


-3- 


manipulating trigonometrically, yields 


where 




\ = tan-‘[Y k /X k l 


(4) 


(5a) 


(5b) 


(5c) 

(5d) 


Here (X, , Y, ) are the in phase and quadrature harmonic correlations, and 
k k 

(E , cp ) are the corresponding harmonic envelope and phase variables. 

K K 

Unfortunately, (4) does not appear to integrate to an immediately obvious 
system implementation. In particular, it does not collapse down to a 
simple in phase and quadrature correlation with p(t) and p[t-(t^/2)], as 
might be conjectured from the well known bandpass case. The latter 
correlator would develop only if sin 0 or cos 0 terms factored out of every 
term in the exponent of (4). That this factorization does not occur in general 
is simply a reiteration of the fact that a single sin wave is the only periodic 
function satisfying the condition that shifted versions of itself are always 
uniquely decomposable into in phase and quadrature components. 


-4- 


Nevertheless, several analytical procedures are possible to reduce 
(4). One is to define the random variable 

00 

z( e ) = ^2 E k cos(k 0 +cp k ) (6) 

k=0 

and to note that A/C is the characteristic function of z evaluated at juu= 1. 
Unfortunately, z is a sum of dependent random sin variables, and its 
probability density is not easily computed. A more fruitable procedure 
is to derive an infinite series solution by using the expansion 


00 

a cos f3 \ A e I (a.) cos mP 
e - / j m m 

m=0 


(7) 


where e is the Nueman parameter and I (a) is the m order imaginary 
m m 

Bessel function. When used in (4), the latter expands to 


A = C 


2TT r 

E HE m i ie+m i cp i 

m «- i i 0 |_ m 


d6 


( 8 ) 


where m = {m 1 ,m_,...}is the vector of integer coefficients m , 

— i z D i 

m^e . Each vector m produces a different harmonic in the integrand. 

However, each such harmonic will integrate to zero in (8), except for those 
in which 



0 


(9) 


-5- 


This reduces (8) to 

A - C £TTV|m.| <E i ,co ’ 

m(0) * 1 1 

where m(0) is the set of integer vectors whose components satisfy (9). 

The optimal detector therefore involves a search and summation over an 
infinite number of integer vectors. Note that the detector makes use of the 
envelope of each harmonic of p(t), but processes it in a rather complicated 
way. At this point, all that can be concluded is that the general detector 
involves a bank of matched envelope detectors producing {E^} a nd {T. } , 
followed by a complicated computer processor that instantaneously computes 
(10). Furthermore, the Bessel functions must be evaluated, unless one 
appeals to high and low signal-to-noise ratio arguments to substitute limiting 
forms . 

Let us examine the implications of (10). Theoretically, one wonders 
why the optimal detector utilizes such complex processing for detection. 

If the harmonic random phase angles in (3) had been statistically independent 
of each other (i. e. , {k9} replaced by {9 }, where the latter is an independent, 

K 

uniform sequence) then the A obtained by averaging over the sequence of 
phase angles would be 

00 

A = C TT I„(E.) (11) 

1=1 0 ' 

as previously reported [ 3 I. We see that this is one term of the sum in (10). 
Thus the remaining terms of the sum must be takii^ advantage of the integer 
phase relation between the random phase angles. From a practical point of 


E 

m(0) 


m.cp. 
l 1 


( 10 ) 


- 6 - 


view, one may also inquire if any type of physically realizable system can 
produce (10), precluding the use of infinitely fast computers. 

A partial answer to those inquiries can be obtained by noting that (10) 
is reminescent of the intermodulation terms arising when a sum of carriers 
is passed through a nonlinearity [4*1. In fact, (10) is proportional to the average 
or "d.c.", value of the output of the nonlinearity e when impressed with 
the input 


x(t) 



E cos(nt + cp ) 
n n 


( 12 ) 


That is, if y(t) = C exp[x(t)"|, then since x(t) in (12) is periodic with 
periodic 2tt, 


Time average 
of y(t) 


lim — j exp[x(t)ldt = C| exp[x(t)]dt (13) 
T-*“ -T "O 


which is identical to the desired Ain (4). The terms in (10) involve precisely 
those output harmonic terms that contribute (beat down) to this average value. 
The optimal processing implied is therefore used to take advantage of the 
phase relation among the harmonics, making use of all beat frequencies that 
contain useful information for detection. In the independent phase case of 
(11), the harmonics are not phase related and the available beat frequencies 
do not aid detection, on the average. Hence, only the zero order component 
is used. Note that the processing is not simply angle shifting each harmonic 
of p (t ) so as to overlap in time, but instead using the nonlinearity to 
intentionally generate all possible beat frequencies that cause harmonic overlap 


-7- 


Equation (13) also suggests a method of implementation. The receiver 
must generate ( 1 0 ) , then pass it through the nonlinearity e , followed by 
averaging (low pass filtering), as shown in Figure 1. The processor 
generating x(t) involves determination of {X r , Y^) from v(t), according to 
(5), then adjusting the amplitude and phase of harmonically locked oscillators, 
as shown in Figure 2. The computation of X and Y involve in phase and 
quadrature harmonic correlation over the T sec observation inverval. The 
overall processor would then be a bank of such harmonic subsystems, one 
for each signal harmori c. Since the averaging implied in (13) must be done 
after these correlations. Figure 1 may be interpreted as a non-real time 
implementation. The processor in Figure 1 can also be interpreted by 
comparing (12) to (6), and noting that 

x(t) = z(0)L_ t (14) 


However, z(0) is also the exponent in (2), with t = t^9/2TT. Thus 


r T r 

/ . 2tt \ 

v(p)p P - 

| 4- 
4-> 

o L 

' 0 /j 


dp 


(15) 


When written as above, the processor output x(t) is the output of a filter at 
the normalized time t(t^/2n), when the input is v(t) and the filter impulse 
response is p(-t), (teO,T). This is simply a matched filter for the periodic 
signal p(t), but the filter is non-causal since p(t) is not zero for negative t. 
[The non-causality is indicative of the fact that all the observable over (0,T) 
is used to generate x(t) at any t within (0,T)."1 The non-causality implies 






Filter- 



X(t) 

Ce X(t > 

v(t) 

Integrator 





I^O 

dt 






J o 



Figure 1 



Amplitude 

Phase 



Figure 2. 


- 8 - 


again the non-real time implementation required for Figure 1. It is 
interesting that a particular non-linearity (exponential) is specified by 
the Bayes detector. 

The extension to non-uniform densities on the delay t can be easily 
accounted for in Figure 1. A non-uniform density, a(9), in the integrand 
of (4) would convert to a correlation rather than an integration in (13). The 
detector in this case would simply replace the low pass filter following the 
non-linearity by a correlator of y(t) and a(t) over the 2tt sec interval. The 
receiver would therefore be required to locally generate this probability 
density as a function of t. 

It may be of interest to further examine why in phase-quadrature 
(I-Q) correlation is not the optimal processor. The I-Q detector for an 
arbitrary periodic p(t) is shown in Figure 3. The input v(t) is simultaneously 
correlated for T sec with p(t) and p(t-t^/2), and the outputs are squared and 
summed. Consider the behavior of the system when only the signal portion 
of v(t) [i.e. p(t-T)1 is impressed at the input. The output of the in phase 
correlator is 


where R (t) is the correlation function of p(t) evaluated at the point T. 
PP 

Similarly, the quadrature correlator produces 




(16) 


Y 



0 



( 17 ) 




Figure 3. 


-9- 


where p(t) is the shifted version of p(t). Since p(t) is periodic, p(t) is also 
the Hilbert transform of p(t). From a well known property of such transforms 

[ A" ] 


R a(t) = R (T) 
pp pp 


(18) 


Defining the complex correlation pre-envelope process 0(t) = R (t) + 

PP 

A 

jR pp (T) allows us to express the I-Q correlator output as 

A 2 2 

q = R (t) + R a ( T ) 

PP PP 


= |0(r) | 


(19) 


Since O(r) is a pre-envelope process, its magnitude equals J2. times the 
magnitude of its real part [l,p.80]. Hence, we write q in (19) as 


q = 2|R (T) 

PP 


( 20 ) 


Thus, in the noiseless case the I-Q detector always produces an output 
equivalent to sampling the squared correlation envelope at the delay T, 

Since this t is random it would be expected that a useful detection sys tern 
should not depend on t. The output of the I-Q detector will not depend on t 
only if the envelope of the correlation function of p(t) does not depend on T. 

For a pure sin wave the correlation function is a cosine wave and its 
envelope is indeed constant. For a narrowband bandpass p(t) the envelope 
is approximately constant over the range of T [i. e. , T€(0,t^) and t^ ^ envelope 
variations]. For both of these examples the I-Q detector is in fact optimal. 


- 10 - 


However, for the general periodic function, q in (19) will depend on T, and 
I-Q correlation is not a plausible detector. 


REFERENCES 


L. E. Franks, "Signal Theory" (book) Prentice-Hall, Inc., 1969, 
Chapter 10. 

Li. Wainstein, V. Zubakov, "Extraction of Signals from Noise" 
(book) Prentice-Hall, Inc., 1962, Chapter 6. 

L. Wainstein, V. Zubakov, "Extraction of Signals from Noise" 
(book) Prentice-Hall, Inc., 1962, p. 192. 

W. Davenport and W. Root, "Random Signals and Noise" (book) 
McGraw-Hill Book Co. , 1958, p. 290. 

A. Papoulis, "Probability, Random Variables, and Stochastic 
Processes" (book) McGraw-Hill Book Co. , 1965, p. 356. 


# 15 - 0?6> 0/ 


USC EE Report 396 




August, 1970 



P° 


Optical Synchronization -Phase Locking 
With Shot Noise Processes 

R. Gagliardi 
M. Haney 

Interim Technical Report 


Department of Electrical Engineering 
University of Southern California 
Los Angeles, California 90007 


This work was sponsored by the National Aeronautics and Space 
Administration, under NASA Contract NGR - 05 - 01 8 - 1 04. This 
grant is part of the research program initiated at NASA's Electronics 
Research Center, Cambridge, Massachusetts and continued at 
Goddard Space Flight Center, Greenbelt, Maryland. 


Page intentionally left blank 


Abstract 


This report presents the results of a study effort examining 
time synchronization in an optical communication system. Consideration 
is given primarily to time locking by means of a phase lock tracking 
loop. Since photo -detection of an intensity modulated optical beam 
produces a shot noise random process at its output, synchronization 
analysis requires a study of phase locking with shot noise processes. 

A statistical analysis of tracking shot noise is presented. Of particular 
interest is the probability density of the tracking error, which indicates 
the behavior of the loop during tracking, and therefore is directly 
related to the ability to maintain accurate synchronization. The results 
of the study also have application to ranging and doppler tracking using 
optical systems. 


iii 


Table of Contents 


Page 

Abstract i 

Chapter 

1. Introduction 1 

1. 1 The Photo -Detection Model 2 

1. 2 Delay Locked and Phase Locked Loops 5 

2. Error Equations For Phase Lock Loops 9 

2. 1 Derivation of Loop Error Dynamics 9 

2.2 Probability Density Equations of 

Random Proces ses 14 

2. 3 Probability Density Equations of Tracking 

Errors 18 

3. Probability Density Solutions 27 

3. 1 High Electron Density Solution 27 

3.2 Higher Order Approximations 30 

3. 3 Second Order Approximations 34 

3.4 Third Order Approximations 40 

3. 5 Accuracy of Truncation Solutions 43 

3.6 The VCO Offset Case 46 

4. Thermal Noise and Photomultiplier Effects 30 

4. 1 Additive Gaussian Thermal Noise 50 

4. 2 Effect of Photomultiplication 53 

5. Second Order Loop Analyses and General 

Tracking Loops 57 

5. 1 The Two Dimensional Smoluchowski Equation 57 

5. 2 Second Order Phase Lock Loops 58 

5. 3 General Delay Tracking Loops 62 

5.4 Example- -Early-Late Gate Tracking 64 

References 68 


IV 


Chapter 1 
INTRODUCTION 

An important operation in communication systems is the maintenance 
of synchronization between transmitter and receiver. This is generally 
accomplished by transmitting continuously over a separate channel a 
known periodic waveform, and having a subsystem of the receiver contin- 
ually track the waveform, thereby providing timing information for the 
entire receiver operation. The tracking is most typically accomplished 
by a delay locked loop which tracks the instantaneous time delay of the 
received synchronizing signal. 

In an optical communication system, the synchronizing signal is 
often transmitted as an intensity modulated optical (laser) beam, which 
is photo-detected at the receiver. The subsequent timing operation is then 
achieved by time locking the receiver delay locked loop to the photo- 
detector output. Since photo -detection of an intensity modulated optical 
beam produces a shot noise random process at its output, the analysis 
of the synchronization subsystem requires careful study of the problem 
of time locking with shot noise input functions. In this report we present 
results of a study of the statistical analysis of tracking shot noise processes. 
Of particular interest is the probability density of the tracking error, which 
indicates the behavior of the loop during the tracking operation, and there- 
fore is directly related to the ability to maintain accurate synchronization. 
The results of the study also have application to ranging and doppler 
tracking using optical systems. 


1.1 


The Photo -Detection Model 


The overall block diagram of the sync subsystem is shown in 
Figure 1. The optical beam is intensity (power) modulated with a 
synchronizing signal. A point source photo -detection responds to the 
received optical radiation by producing the output shot noise process [6, 7] 
N(0, t) 

x(t)=2-> eh(t-t) (1-1) 

i m 

m=l 

where e is the electron charge, h(t) is the photo-electron waveshape in 
the photo -detector , t are the random location times of each photo - 
electron and N ( 0 , t ) is the number of photo-electrons occurring during 
the time interval (0, t). The random process N(0,t) is called the counting 
process of the shot noise and has a mean value given by [2, 3,4] 
t 

N = J n(y) dy (1-2) 

0 

where ' 

n(t) = Y P(t) = intensity of the counting process, or average 
rate of photo -electron occurrences. 

P(t) = instantaneous power in the received optical field. 

Y = proportionality constant dependent upon the optical 

carrier frequency, Planck's constant, and the detector 
efficiency. 

Note that the average rate of photo-electron occurrences is proportional 
to P(t), the power modulation on the optical beam. This means that in 
the case of optical synchronization, the intensity process n(t) in (1-2) 
is directly proportional to the synchronizing signal that power modulates 
the optical beam. 

When the bandwidth of the photo -detector is large relative to 
the bandwidth of the intensity n(t), the electron functions in (1-1) can be 


- 2 - 


OPTICAL ■ 

SIGNAL 

(INTENSITY 

MODULATED ) 

PHOTO - 

DETECTOR 


BACKGROUND 

RADIATION 

LO 

I 



CARRIER SIGNAL _ 
(PHASE MODULATED) 


PHASE- LOCKED 
LOOP 


Figure 1. 


considered as delta functions. In addition, N(0,t) becomes a Poisson 
counting process [ 5, 6, 8] , and the probability of j photo-electrons 
occurring in an interval (0,t) is given by 


Prob L n( 0, t) = j] = 


N j 

j'- 



(1-3) 


For shot noise process governed by Poisson counting, the random location 
times are independent, and have the probability density L 5, 9, 10] 

n (t ) 

P(t)= -d 22 — ( U4 ) 

N 

where N is the average of N(0, t) in (1 -3) and is given in (1 -2). Thus, 
the intensity process n(t), in addition to specifying the average rate of 
electron occurrences, also defines the probability density of location 
times of the electrons. Using (1-3) and (1-4) it can be shown [5,6] 
that the mean of the shot noise x(t) in (1-1) is 

t 

[mean x(t)] = J 6 (t-y) n(y) dy = n(t) (1-5) 

0 

for wideband detectors. Hence, the mean of the photo-detector output 
in Figure 1 corresponds to the synchronizing signal used at the transmitter. 


- 4 - 


1.2 


Delay Locked and Phase Locked Loops 


A delay locked loop is a feedback tracking system used to time 
lock a locally generated periodic signal to the received periodic 
synchronizing signal. During each period, the two signals are time 
compared, and differences in timing generate error voltages that are 
fed back to control the timing of the local signal generator. The choice 
of signals at the transmitter and receiver determine the sensitivity 
of the error voltage to the timing difference. When the two signals 
are exactly in step during each period, the error voltage is zero, and 
the local signal remains time synchronized with the received sync 
signal. When this occurs, the local signal generator is producing a 
clean, time locked signal that can be used for timing in the remainder 
of the receiver. Instantaneous error voltages due to input noise 
represent random timing errors between the two signals, and therefore 
appear as synchronization errors in the receiver operation. 

When the synchronizing and local signal are taken as sinusoids, 
the delay locked loop is called a phase lock loop Cl] (since timing errors 
can be directly related to phase errors in the sinusoids). In phase lock 
loops, the signal generator is simply a voltage controlled oscillator 
(VCO), and the timing difference is produced in a filtered frequency 
mixer, as shown in Figure 2. The phase variation on the synchronizing 
sinusoid is then the phase signal that is to be tracked by the loop. 

For example, if the synchronizing signal were taken as sinCw. t+ 9 ^ (t ) 3 , 
then the loop must generate an error voltage that drives the local VCO 
in accordance with 6j(t). 

The loop filter in Figure 2 smooths the error voltage for control 
of the VCO. The complexity of the loop, and of the associated analysis, 




- 5 - 


MIXER 


INPUT i(t) 

v 

x(t) i 


A 



I 


K, cos0'(t) 


Figure 2 


LINEAR 
FILTER, f(t) 


VOLTAGE 

CONTROLLED 

OSCILLATOR 

(VCO) 


is determined by the type of filtering used. For a first order loop, the 
filter is removed and the mixer error signal feeds directly the VCO. 

A second order loop is produced if the loop filter effectively produces 
an integration. Higher order loops are generated by introducing more 
filter integration. 

The loop mixer simply "beats" together the input and VCO 
sinusoid. Since the mixer is inherently bandlimited, only baseband 
frequencies are produced at the mixer output, while harmonics of the 
VCO center frequency are eliminated. 

The error voltage in a phase lock loop is directly related to the 
phase difference between the VCO and the loop input signal at each instant 
of time. Hence, analytical measures of loop performance can be obtained 
through derivation of the loop error equations. Though these equations 
are generally nonlinear, the response of the loop to a "clean" synchronizing 
signal can usually be determined using basic nonlinear feedback analysis. 
Typically, the loop "pulls into" lock and the steady state loop error is driven 
to zero, or else the system is unstable and the loop "falls out" of lock. 

On the other hand, when the loop input is stochastic, the loop error responds 
in a random manner. In this case one can only describe the error statis- 
tically by its probability density. The derivation of this density, which 
is generally non -stationary, is complicated by the non-linearity of the loop. 
Often, we resort to a steady state density as an indication of the statistical 
loop behavior. The steady state variance of the loop error is then a 
direct indication of the phase error caused by the randomness of the 
input. 

In the past Ll, 11 ] the above analytical procedures have been 
extensively applied to the case where the input randomness is due to 


- 7 - 


additive Gaussian noise; i. e. , the loop input is composed of the sum of 
a clean synchronizing signal plus additive Gaussian noise. However, 
in the optical model of Figure 1, the randomness at the loop input is due 
to the shot noise nature of the photo -detector output. The remainder 
of this report is devoted to an investigation of the loop phase error 
when the phase lock loop in Figure 2 is forced by the input shot noise 
process in (1 -1 ). 


- 8 - 


Chapter 2 


ERROR EQUATIONS FOR PHASE LOCK LOOPS 

In this chapter we analytically investigate the ability of a phase 

lock loop to lock to a synchronizing signal that has been optically 
transmitted and photo -detected . Mathematically, the basic problem 
is that of determining the behavior of a phase lock loop when its input 
is a shot noise process having the synchronizing signal as its intensity 
process. In the following section, we derive the dynamical equations 
that describe the evolution of the phase error for such a system. 

2. 1 Derivation of Loop Error Dynamics 

Consider the system shown in Figure 2 where the loop 
input function is the shot noise process at the wideband photo-detector 
output, given by (1-1): 

imt) 

x(t) = 2 j e . (2-1) 

m=l 

Here, e is the electron charge, (t } are the random location times, 

e m 

6(t) the electron functions and N(0,t) is the shot noise counting process 
having intensity 

n g (t) = A{ 1+ b sinCtu t + ^(t)]} (2-2) 

The above is proportional to the transmitted intensity modulation and 
represents the synchronizing signal. In (2-2), tw is the synchronizing 
frequency, b is the modulation index, ®^(t) is the phase (time delay) 
variation on the synchronizing signal that is to be instantaneously tracked 
by the loop and A is the average value of n g (t). Recall from (1-2) that 
n g (t) can equivalently be interpreted as the rate of electron occurrences 


- 9 - 


in the photo-detector, so that A represents the average number of 
electrons produced per sec. 

The VCO output in Figure 2 is represented by 

VCO output = k^ cos C 'JJpt + (t ) ] (2-3) 

where is the VCO gain, is the VCO rest frequency, and ! ^(t) 
its phase variation. The loop phase error is defined as the phase 
difference between the synchronizing signal phase and the loop VCO 
phase, and therefore is 

$ (t) = [<« s t + 9 x (t)3 -0 Q t + © 2 (t)3 = (^ s - ,iJ 0 )t + 0 1 (t)- 0 2 (t) . (2-4) 

The loop mixer output is then 


e (t) = x(t) [ VCO output] 
m 


= k^ cos [ w t + 


N(0, t) 

(t)][ D. e 6( t . 
u m= 1 


t ) 

m 


] 


(2-5) 


and the loop filter output is 
t 

e f (t) = I e (T) f(t-T) dT . (2-6) 

0 

The VCO output phase responds to the VCO input control voltage e^(t) 
through the linear relation 

dS <t) 

Si- = k 2 e f (l) (2 - 7 > 

with k^ a constant of proportionality. From (2-4) we have, upon 
differentiating, 


- 10 - 


( 2 - 8 ) 


d*(t) 

dt 


( IJJ . 


-v + 



d9 


2 


dt 


(“s ‘ V + 


d8 1 

dt k 2 e f^ * 


The term ('V - ^q) is the difference between the input synchronizing 
frequency and the VCO rest frequency and is called the frequency 
"offset" of the loop. Substitution from (2-6) then yields 


x d° t 

■ft = ( W + - e k J f(t-T) cosC«) g T + e^T) _*(T)]. 

N(0, T ) 

•£ 6 ( T -t m )dT (2-9) 

m=l 


where k = k^k^ ’ and can be interpreted as the total gain around the 
loop. Equation (2-9) is then the stochastic integro-differential equation 
that describes the behavior of the loop phase error in terms of the 
input signal and loop parameters. Note that it is a non-linear equation 
with $ (t) appearing on both sides of the equation. The input shot noise 
and the phase variation of the transmitted synchronizing signal play 
the role of "forcing" functions in the generation of the error process. 
Since the input shot noise contains random parameters, the solution 
for $ (t) necessarily evolves as a stochastic process. 

We ultimately will be interested in the statistical properties of 
the phase error. We may however note that a sample expression for 
the mean of §(t) in (2-9) can be generated, which may be useful in signal 
design. If we average both sides of (2-9) and interchange averaging 
and differentiation on the left, we see that 


- 11 - 


a<s (t) 
st 


d9 


= > S -V + TTJ - ek J'. f(t - T » 


N 


\E cost'" T + 0 $ (T )] £ ,h( T-t )\ di 
t si m=l m > 


( 2 - 10 ) 


where ~$(t) is the mean of $ (t). The averaging in the integrand can 
be carried out by using the conditional expectations: 

E[ E i\ E t n|» [ ' (2 ' U) 

m’ 

The inner expectation involves only the average of the shot noise, 
which is given in (1-5) as 

N 

E S 6 (t-t ) = n (t) . (2-12) 

, m s 
m= 1 


Substitution into (2-11), allows us to rewrite the braces in (2-10) as 

E$ [ sin[(W Q - w g )t +* ]} + E$C terms at (W Q + <u)] . (2-13) 

The loop filtering in (2-9) eliminates the sum frequency term. Hence, 
(2-10) becomes 


a *(t) 
~~51 


r d9i -I r , 

L ( W + uf j “ ek J Ejtsinpjyw) T+ #(T )} d' 


(2-14) 


The above is interesting in that it shows that if the loop is tracking 
frequency and phase fairly accurately (i. e. , = U)^ and sinf-P) ) ^ then 

(2-14) is approximately 

j Q . 

= at^-ekj f(t-T)f(T)dT . (2-15) 


- 12 - 


This equation has the form of the deterministic tracking error produced 
in the linear feedback loop shown in Figure 3, when forced by the input 
^(t). Note that the equivalent linear loop replaces the VCO by an 
integrator, the mixer by a subtractor, and retains the same loop filter. 
Hence, the loop error function in (2-9) has a mean value such that when 
the loop is tracking well L i. e. , I $ (t) | < < l], the mean varies in time 
according to the error function of the linear system in Figure 3. The 
latter system can therefore be used to design loop filters and compute 
mean error performance. 

For a complete statistical analysis, however, we must return 
to (2-9) for study. The complexity of the error process $ (t) is exhibited 
even if we consider a simplified special case. For example, consider 
a first order loop in which the loop filter is removed. L This effectively 
replaces f(t) by a delta function in (2-6). ] In this case, (2-9) becomes 


d$ 

dt 




d e - 

V + nr - 


-ek cos[u) t + 9, (t)- $(t)3 
s i 


N (0, t) 
• '£ 
m = l 


6 (t-t ) 

m 


(2-16) 


Though simplified, (2-16) is still a non-linear differential equation 
involving the random loop error process $ (t). By integrating both sides 
we note 


N (0, t) 

$(t) = C (w - u) )t + 6. (t)] -ek^-' cost® t 

s 0 1 m=l s m 


+ 6. (t )-$(t )] . 

1 m m 


(2-17) 


The second term represents a summation of random "jumps", the height 
of the jumps dependent upon $(t) itself. This identifies the process 
$(t) in (2-16) as a discontinuous, or "jump", process in which the 


- 13 - 




I 




F. 


3 


O V" 


>1 


4 


3 


number of jumps are governed by the counting process N(0,t). Therefore, 
even for this specialized case, the complexity of the error process is 
apparent. 

In the following section we derive an equation involving the 
probability density of a general random process. Subsequently, we shall 
apply the result to the error process generated in (2-9). 

2.2 Probability Density Equations of Random Processes 

Let $(t) be a scalar random process, and let p(^,tj) represent 
the probability density function (pdf) of the process at time t^ in the variable 
Similarly, denote p ($^ , t ^ | $ 2 ’ ^ 3S conc ^^^ ona ^ pdf $(t) at time 

t^, given that $ (t^ ) = $ ^ time t^. The pdf is then always related to 
the conditional pdf by 

CO 

p(l 1 ,t 1 )=J P ( \ » t x | , t^ ) p ($ 2 , ) d$ z . (2-18) 

Note that the conditional density can be interpreted as a transitional 
density in the sense that it "converts" the pdf at time t^ to its new 
density at time t^. When t^ > t^, this transitional density essentially 
indicates the manner in which the pdf propagates in time. 

Equation (2-18) can be rewritten in a different form for 
convenient interpretation and application. Define the conditional 
characteristic function of the random increment — as 

« jiJU($ _ $ ) 

C A (W) = J e 1 2 p($ 1 ,t 1 | $ 2 ,t 2 ) d$ L . (2-19) 

_C0 

By inverse Fourier transform 

P( $ l- t l ,$ 2’ t 2 ) = 2^ J e 1 2 C A (w)dw. (2-20) 

_ CO 


- 14 - 


Substitution of (2-20) into (2-18) then yields 

P<* r ‘i ) = Tn J’„ pf 2’*2 )di 2J'„ e 1 2 C A ( “ )d “ • < 2 - 21 » 

Now it is well known that the characteristic function can be expanded 
into moments as 


r (II') = 1 +E Mp- m. (&$) 
i=l 1* i 


(2-22) 


where 


m.(A$) = E i C(t r $ 2 ) X ! 


(2-23) 


is the i^ 1 conditional moment of given $ (t^) = [Alternatively, 

m.(^§) are the moments of the conditional pdf in (2-20). 3 It follows 


l 

that 


°° 1 I-” -jW(§,-$ ? ) . 

P(V t l) =Zj ?tTTT J p(k,t ? )d$ ? J m(bi)e (j l|J ) d^ 

1 Q ^ 1 * _ CO ^ ^ ^ _CD 1 


(2-24) 


But 


J_ ” i 

2TT J e (j^) d'JU 


~ i 00 _ -i UJ ( $ -$ ) 

/ a \ |- j y i 

(- -j^) J__e 

■ 6 <V*2> 


(2-25) 


and (2 -24) becomes 


- 15 - 


00 


P(\ 


v = 


L, 
i = 0 




6($1 -£,) d$ 7 


= P( $ ,.t ? ) 

C i=l 






(2-26) 


The first term is the pdf at time t = t^, and the summation represents 
the increment in this latter pdf to produce the pdf at t = t^. If we set 
t^ = t and = t + A t, then (2-26) becomes 

CD j 

P l* 1 ,t + M)-p(* ll t)= £ jj (- m.(A# ) p(^,t 2 ) . (2-27) 

Dividing by ^t and passing to the limit as At - * 0 we obtain 


5 Pj*- t) = z 

9t i=l 


iV (- t rj c K.(t > p(t.t)] , 


where 


(2-28) 


K.(*) 


lim 
A t - 


0 


r ec^i $)i i 
L 21 J 


(2-29) 


Equation (2-28) is called the stochastic kinetic equation C 1 7 3 , or the 
Smoluchowski-Komogorov equation C 1 6 3 . When the coefficients K^($ ) 
exist, this equation provides a relation that must be satisfied by the 
pdf of the process $ (t). Note that the equation is a partial differential 
equation with variable coefficients, and involve all orders of derivatives. 
The remarkable point is that no continuity conditions on $ (t) were 
required, so that the equation is valid whether $(t) is continuous or not. 
In essence, the integral equation in (2-18) has been replaced by the 
differential equation in (2-28). Furthermore, while one needs the 
complete conditional pdf to carry out (2-18), only the moments of this 
density are needed to derive (2-28). 


- 16 - 


The principle usefulness of the Smoluchowski equation occurs 


when only the first few coefficients ) are non-zero. In particular, 
if = 0, i - 3, the resulting equation is called the Fokker -Planck 
equation, and has been extensively studied Ll]. The Fokker -Planck 
equation will arise whenever the random process $ (t) is continuous, 
while discontinuous processes generate all the coefficients in (2-28) C 1 7 3 . 
We would expect this latter condition to be true for our process $(t), 
based upon our earlier discussion of the apparent jump nature of the 
error function. Equation (2-28) is a partial differential equation 
of the type 

= L $ [p(*,t)] (2-30) 

where L| is a differential operator in $ . The usual method for 
solving this type of equation is by separation of variables. In this method 
it is assumed that 

p(* ,t) = K(t) p(*) (2-31) 

and a solution is desired that satisfies Equation (2-30) with the 
appropriate initial conditions. Substitution into Equation (2-30) yields 

— — = — Lj[p(i)]. (2-32) 

K(t) dt p($ ) 

Since the left side depends only on t, and the right side only on $ , 
they can be equal only if they equal a constant. Thus 


dK(t) 

dt 


cK (t) 


L $ L p($ )1 = cp($ ) 


(2-33) 


- 17 - 


for some c if a solution is to be found by this method. Furthermore, 
if { c^) is a set of values of c which satisfy the above, then p($,t) 
must be of the form 


-c (t) 

p(*,t) = ^ B .(* )e 
i 


(2-34) 


where the ($)} are determined by appropriate initial conditions. Since 
each term of the sum approaches zero as t goes to infinity for all c^ 
greater than zero, the steady stare solution, p($) (defined as the 
limiting form of p($, t) as t "* °°), must be due to the value of c^ = 0. 
Therefore, from (2-33), the steady state solution satisfies 


L*Cp(*)J =0. - (2-35) 

Thus, the steady state solution to (2-35) (if one exists) is the solution 
to a differential equation obtained by setting the right hand side of 
(2-30) equal to zero and replacing p($,t) by p($). 


2 . 3 Probability Density Equations of Loop Tracking Errors 

It has been shown that a general random process has a probability 
density which satisfies the Kolmogorov partial differential equation. We have 
seen that this equation may, however, involve an infinite number of 
derivative terms. In this section we would like to derive the corresponding 
pdf equation for the phase error process of a tracking loop, governed by 
the dynamical equation in (2-9). To accomplish this, we must calculate 
the sequence of moment coefficients K^($) given by (2-2 9). This in turn 
requires determinations of the phase increment of $(t) during the 
interval (t, t + At). 


- 18 - 


Consider a first order phase lock loop tracking a synchronizing 
signal with a constant delay, following wideband photo-detection. The 
phase error $(t) then satisfies the differential equation (2-16), and has 
the form: 


, NJO , t ) 

— = ek cos[w t + 0, -$(t)l • S (t -t ) (2-36) 

,, 0 1 , m 

dt m=l 


where 9^ is the constant phase delay. Note that the forcing function 
in (2-15) is zero, so that the steady state mean error is zero. The phase 
variation A$ i s obtained by integrating d$ from t to t + At. Thus, from 


(2-16) 


A$ = 


t +At t + At 

J d* = J‘ 

t t 



t+At. N(0,t) 

= — ek| cosC'U t + 0, - $(t)] • ^ 6(t-t ) dt 

J si , m 

t m=1 

N(At) 

= — ek ^ cos t ^ t + 0. — $ (t )1 


(2-37) 


where N(At) is the number of electron occurrences in the interval 
(t, t + At). The above expresses the increment of the phase variation 
during (t, t + At). Note that this variation is also a "jump process", 
having randomly occurring "jumps" of random heights , and that the 
argument of the cosine function depends upon the process $(t) itself 
(which emphasizes the non-linearity of the loop dynamics). 

Now, from Equation (2-29) 


- 19 - 


K (*) 
n 


= lim i E t L(^^) n |$] 

A t - o A* N m 


i • (-ek) 

lim --- a - , — 

At -o 


n 


Xt /* 

m 


N(^t) 

£ 

m=l 


“in 


cos9'(t ) 


m 


where 9' = Qu t + 9^-$ ] and the expectation is conditioned on £ 
quantity in brackets becomes 

N(At) N(At) N(^t) 

£ £ £ cos 6 (t ) cos 9 1 (t ) • • • cos ® ' (t ) 


m, =1 m_ =1 m =1 
1 2 n 


m l m 2 


m 


( 2 - 38 ) 


The 


which is 
N(At) 


£ cos" 0 '(t) + £ . 


N(At) N(At) 


£ 


COS 6 1 (t )• • • COS 0 ' (t ) 


m 


m 


m=l 


m, =1 m =1 
1 n 


n 


m, 4 m 0 4 • • • 4 m . 

Vi. n 


The expectation over just the second term above is 




N(At) N(At) 

■L ... Z E t„/N, « 1 COS 6 ' (t m, > • ' • cose ' (t m )J 

m, =1 m =1 
1 n 


m 


n 


m, 4 m., 4 * * • 4 m 

12 n 


where E t /jq $ is a conditional expectation given N and £ The 


m 


expectation over N(At) simply becomes the average of the counting 
process over (t, t + At). Since this expression does not involve those 
terms where m^ = m^ = . . . = m^, the above experession becomes 

(N n -N) E . . . . Ceos 9'(t ) cos ® 1 (t )• • • cos 9 ' (t )] 

(t , t , • • • , t ) m, m? m 

m i m q m i £ n 

1 l n 


where from (1-2) 


- 20 - 


-t + At 

N = J n(T ) dT 

t 

The conditional expectation of the term in the brackets requires 
the n-dimensional joint probability density of the n random variables 
(t }. For Poisson shot noise processes this is obtained from (1-4) 


as 


P(t , t 


m 


m n 


t n) = 
m 

n 


1 


n 

rr 


n(t ) 


n , ' m' 

N m=1 


Therefore, the conditional expectation over the { t ■ is 


t + At t + At 

J ... J [cos ®'(t ). • • cos 9 1 (t )]Cn(t^ )-**n(t^ )] dt 


t t +t 

m , 
n - 1 

for t * t ^ . . . s t 

m, m 


dt 


m l ril n XI1 1 U1 n m l 

s (t + At). As we take the limit as A t goes to 


m 


n 


1 n 

zero this expression behaves as 


Ceos 9 * (t ) • • . cos 9 (t )] [ n(t )• • • n(t )] (At) n 
m, m m, m 


1 


n 


1 


n 


Therefore, taking the limit as At goes to zero the above expression 
n - 1 

behaves as (At) which goes to zero. Hence the second term resulting 
from Equation (2-38) is zero and 

,n 


N(At) 


K (*) = lim -U^- 
n At-0 At 


E N,t /* L . COsn 0 '< t m ) 
m m=l 


= lim NE Ceos 11 e ' (t )] . 

At - 0 At W* m 


The expectation of the bracketed term is 


t + At 

COS n 0 ' (t ) 
l m 


n(t ) 
m 


N 


dt 

m 


- 21 - 


and 


K (*) = (-ek) n cos n 9 ' (t) n(t) . 


(2-39) 


This equation represents the general nth conditional moment of the 

increment of the phase error. Note that it is in terms of the feedback 

signal and the intensity modulation, n (t). Since Equation (2-39) is 

basically a product of sinusoids, K n ($) will contain sine waves at the 

’’beat” frequencies. Remembering that terms involving frequencies 

of n^, n s 1, are eliminated by the mixer , the general expressions 

for K (*) be come 
n 


K (*) 


-C (ek) n Absin$ n-odd 

n 

< 

C .(ek) n (A) n-even 

V n - 1 


(2-40) 


where 


C 

n 


n 

n 

i=l 


i + 1 


n-odd 

i-odd 


(2-41) 


The series form of the pdf equation now becomes 


Mathematically, we are implying that the expectation operation in 
Equation (2-29) contains an additional time averaging operation, 
caused by the filtering effects of the mixer. Thus, to be rigorous, 
a time averaged version of K n ($) is being computed. 


- 22 - 


a P (* ,t) 
St ' 


co 


C (ek) n a n 
= Ab ^ — — ; — — Csin$ p(§,t)] + 

n=l n - 9* n 
(n-odd) 


CO 

A E 
n=2 


C n-l (ek) 


n 


S n p(^,t) 


n ! 




n 


(n-even) 


(2-42) 


The solution to this equation is the pdf, p($,t), of the phase error, 

$, at each instant of time, t. Note that the equation is an infinite 
order partial differential equation with coefficients that are functions 
of the variable $. The infinite number of derivative terms can be 
directly attributed to the "jump"nature of the phase error process. 
The steady state solution of the pdf is given by (2-35) obtained by 
setting the right side of (2-42) equal to zero. Thus, with p($) denoting 
the steady state pdf, we have 


C (ek) n 

0 = Ab E — H— 

n=l n! 

(n-odd) 


— [ sin $ p($ )] + 

d$ n 


A 

n = 2 


C n-l (6k)n 

n! 


(n-even) 


d n P ($) 

d$ n 


(2-44) 


The steady state pdf can be determined by solving the above total 
differential equation with the appropriate initial conditions. The 
equation is still, however, of infinite order and the hope of obtaining 
an exact solution is somewhat ambitious. Nevertheless, there is still 
useable information that may be extracted from Equation (2-44) 
without a complete solution. For example, we note that the coefficients 
are periodic in $, implying that if p($) is a solution to (2-44) then 
p($ + 2 n ) is also a solution. Hence, steady state pdf solutions are 
periodic with period 2^. For this reason we need only concentrate 


- 23 - 


on deriving a normalized solution over a single period, and $ will 
therefore be constrained (- TT , TT ) in the subsequent analysis. For 
convenience, we can rewrite (2-44) in a slightly different form by 
first dividing through by the coefficient for n =2. This yields 



(2-45) 


where a = 2b/ek. For a first order loop the gain k is directly 
related to the loop noise bandwidth B by [l ] 

R - eAk 

b l " ~r • 


(2-46) 


Since it is desirous to operate the loop with a given bandwidth, the 
loop gain k must be adjusted to achieve this value. Hence, k = 4B /eA 

J-l 

and the a parameter in (2-45) takes the form 


a = 


Ab 

2b l • 


(2-47) 


The coefficient Ab can be interpreted as the average rate of electrons 
of the intensity modulation by the synchronizing signal. In this light, 
a is then the average number of electrons produced in a 1/2B^ time 
period, i. e. , in a time period corresponding to the reciprocal of the 
designed carrier bandwidth. Hence or can be considered an electron 
function "density", indicating the accumulation of electron occurrences 
over a fixed time period. By relating electron occurrences to photons, 
the density a can also be interpreted in terms of received synchronizing 


- 24 - 


energy, or in terms of signal to noise ratios. In particular, if we 

2 

multiply numerator and denominator by e A, then 

a = b — . (2-48) 

e A(2B l ) 

2 

The term (eA) is proportional to the average current power 

2 

in the synchronizing signal, while (e A) is the spectral level of the shot 

2 

noise power spectrum and (e A)2B T is proportional to the total shot 

L j 

noise power in a 2B bandwidth. Hence, a can also be considered 

i-J 

an indication of the signal-to-shot noise power ratio. As such, we 
would expect performance to improve as or increases. This would 
mean the modulation index b should be as large as possible for best 
operation. We shall find this conjecture is true, and therefore from 
here on b will be given its maximum possible value (b = 1 ) in (2-47). 

Note that the higher order coefficients in (2-45) decrease 
with increasing <* . This appears to indicate a diminishing importance 
of the higher derivative terms in contributing to the solution as 
increases. This conjecture will be investigated in the next chapter, 
and will be shown to have both a mathematical and physical inter- 
pretation. 

One last point is worthy of comment concerning (2-45). 

Note that the only parameter effecting the equation, and therefore 
the solution, is &, the electron (photon) density in a 1/2B time 

Li 

period. In particular, the synchronizing carrier frequency ^ 
in (2-2) does not appear in the solution. Hence, it is meaningless 
to cite values of numbers of electrons (photons) per cycle of synchron- 
izing carrier frequency in discussing optical time locking. It is 


- 25 - 


only the number per cycle of loop bandwidth that is significant. 
Of course, the sync frequency is important in converting 
phase errors in radians to timing errors in seconds. 


- 26 - 


Chapter 3 

PROBABILITY DENSITY SOLUTIONS 


In Chapter 2 an infinite order differential equation was derived 


for the steady state probability density of the loop phase error of a 
first order tracking loop with shot noise inputs. The equation showed 
that the coefficients of the resulting derivative terms in the equation 
depended upon the electron function rate in the photo -detector, which 
in turn depended upon the received radiation power. In this chapter 
we investigate approximate solutions for the desired probability 
density of the tracking error. 

3. 1 High Electron Density Solution 


For the case where the function density a in (2-47) is extremely 


high, a first approximation to the solution of Eq. (2-45) can be obtained 
by dropping all terms that have powers of 1 /« as coefficients. This 
leads to the equation 


where p($) is the steady state density and a is the electron density 
at the photo -detector output: 



Equation (3-1) is just the steady state form of the Fokker - Planck 
equation and can easily be solved. Integrating both sides yields 


0 = a 



(3-1) 



(3-3) 


- 27 - 


where Cq is an arbitrary constant. This equation can be solved over 
the interval, -TT * * ■ TT t with the two boundary conditions: 

1) PC 11 ) = p( _TT ) (periodicity) 

TT 

2) J‘ p(* ) d* = 1 . 

_tt 

The solution is 

®cos $ 

P(*) = ± (3.4) 

2TTI q (o:) 

where 1^ is the imaginary Bessel function. Equation (3-4) is plotted 
in Figure 4, for various a . Note that the probability density 
approaches, for large Qf , a delta function at zero, while for 0, 

it approaches a uniform density over the phase error interval. 

The former case can be considered the limit of perfect tracking, 
while the latter represents a completely random phase error; i. e. , 
poor phase tracking. The ability to track is therefore directly related 
to the value of the « parameter. 

It is of interest to note that the solution in (3-4) is the same 
solution obtained for the first order loop when driven by a sinusoidal 
signal plus additive while Gaussian noise Ll , 1 1 ] . Thus, the error 
differential equation due to shot noise inputs becomes identical to that 
due to additive input Gaussian noise as the higher order coefficients 
are eliminated. In essence, this serves as an apparent justification 
for the truncation of Eq. (2-45) to (3-1) for large values of a since 
it has been shown C 3, 5, 6] that a discrete poisson shot noise process 
approaches a continuous Gaussian process as a co . Thus, for 


- 28 - 


p(<£) 



<£ (RADIANS) 

Figure 4. 


- 29 - 


Of >> 1, the shot noise error pdf is, to a first approximation, given by 
the solution for additive Gaussian noise inputs. 

The pdf in (3-4) has zero mean and variance given by 


TT t * » (-l) n i_(Q') 


= _ + 4 E 

$ 3 


n 


(3-5) 


n = l n I 0 («) 

where I n ( a ) i- s the n th order imaginary Bessel function. This 

variance is shown as a function of a * in Figure 5. As the para- 

2 

meter “ approaches zero the variance approaches 17 / 3 , the variance 
of a uniformly distributed random variable over the interval, 

(- n , TT ). It may be seen that the tracking variance for the steady 
state pdf of the phase error is approximately proportional to l/ a 
for large a . For Qf below 5, the variance increases rapidly, 
but the range of validity of the high density solution is questionable. 


3.2 High Order Approximations 

The density in (3-4) is in theory valid only as Qf it is 

not obvious, however, how accurate this solution is for finite Qf. 

In this section we investigate higher order truncations of the inifinite 
order equation in (2-45), and the associated solutions, in order to 
obtain better approximations to the true solution. After integrating 
(2-45) once with respect to $ , expanding the derivatives of sin$p($) , 
and collecting like derivatives of p($ ), we have 
^ ,n 

C n = L F J *) ^P( $ ) • ( 3 " 6 > 

0 n = 0 n d* n 


Here is the constant of integration and the F ($ ) functions are of the 
0 n 

form 


- 30 - 



•/ Q 


Figure 5. 


- 31 - 


F 0 (*) = sin * (« - ± ~ ~ —5 

U 12a 144a 3 


+ • • • ) 


Fj(*) = 1 + cos* ( I _ -L + _i 


3<* 24a- 


F ? (*) = sin* ( ^5 ~ 

2 a^ 48» 


F 3 (*)= -^-+cos*( ^- 5 - + -..) 


4a 


3 a 36a" 


(3-7) 


F 4 (* ) = sin * ( -i-g- 


+ • • • ) 


12 a 48»~ 


F c( $ ) = -U-+ cos* ( 

36a 24“ 

F/ (* ) = sin *( — - • • • ) 

6 144a 5 


etc. 


Note that the functions, F n (*), decrease with a (for a ^ 1 and n s 1) 

and it is reasonable to assume thats olutions to truncations involving 

higher order terms of Equation (3-7) may yield higher order 

approximations to the total solution of the finite-order differential 

equation. The solution to the truncated equation involving terms up to 

and including the jth derivative of p(*) will be called the jth-order 

truncation solution. The function, F (*), in general involves terms 

n 

derived from all the odd order derivatives of order s n+1 in Equation 
(3-6) operating on sin*p(* ). Therefore, when forming the jth truncated 
equation from Equation (3 - 6 ), the functions F^(*) must also be 
appropriately truncated. For example, the solution to the Fokker -Planck 
equation treated previously may also be called the first-order truncation 


- 32 - 


solution to Equation (3-6). Since, for a given s 1, the functions, 

F_ (*), decrease in magnitude rapidly with n it is reasonable to expect 
n 

that solutions (assuming they can be found) to increasingly higher-order 
truncated equations would also reduce respectively the remainder, 
when the higher-order truncation solutions are substituted for p($ ) in 
Equation (3-6). This will be examined below as higher-order truncation 
solutions are found. 

A method exists for solving progressively higher -order 
truncated versions of Equation (3-6). From Ince Ll4] , Boyce and 
DiPrima C 1 5 3 , and Coddington and Levinson [ 1 3 3 it is shown that 
the method of Frobenius which assumes a series solution for p($) of 
the form 

P(§) = Z A r , A q i 0 (3-8) 

n=0 

is applicable to any-order truncated version of Equation (3-6), even 
(in theory) the total infinite -order solution. However, to solve exactly, 
any nth-order truncated equation from Equation (3-6) it is necessary 
to have n+1 boundary conditions (recall in Equation (3-6) is an 
unknown constant of integration). In addition to the boundary conditions 
previously introduced, additional boundary conditions must be specified 
in order to solve the higher order differential equations. 

For the non-offset case, the primary assumption that will be 
imposed to evaluate the necessary boundary conditions is that the 
solutions to (3-6) are symmetric about $ = 0. The solution is therefore 
an even function about $ = 0, and between -TT and ^ it can be expanded 
in a Fourier series as an infinite sum of cosines, 


- 33 - 


00 


(3-9) 


pC$ ) = Zj a cos n$ 
n= 0 n 

where the a 's are coefficients. From this expression it can be 
n 

seen that all odd order derivatives of p($) are zero at $ = 0 and $ = 4^ . 
Furthermore, evaluation of the right side of (3-6) at $ = 0, with this 
zero condition for the odd derivatives, shows that Cq is zero. In 
addition to these initial conditions, we shall further impose the restriction 
that all even order derivatives, evaluated at * = + ^ , will be zero 
also. This results in the set of boundary conditions: 


d n p(*) 
d n $ 



, for all n ^ 1 . 


(3-10) 


These conditions, along with the two used in (3-4) will provide a 

solution to any order truncation of (3-6). In the following sections, solutions 

to second order and third order truncated equations will be determined. 


3. 3 Second -Order Truncation Solution 

The second-order truncation of Equation (3-6) becomes 


sgl P "(* ) + (1 + C -^L ) p'($) + (a _ ±- ) sin* p(§) = 0. (3-11) 

The point $ = 0 is a regular singular point of Equation (3-11) and there- 
fore by Theorem 4. 3 of Boyce and DiPrima [l5] a series solution 
exists of the form given by Equation (3-8), in either of the intervals 
_ p< $ <o or 0 < $ < P where P is some positive number. The 
value of P is the radius of convergence of the series in Equation (3-8), 
and is at least equal to the distance from the origin to the nearest zero 
of sin$/2Qf , which is at TT Hence, a series solution can be found for $ 


- 34 - 


in the range - n to n for which the series converges. 

By writing sin$ and cos$ in their series expansions, substituting 

Equation (3-8) into Equation (3-11), and collecting like powers of 

$, solutions for m and A can be found. Two solutions are found for 

n 

m, one being zero and the other nonzero. Only the zero value for m 
yields a non-trivial results and the resulting values for A^, n even, 
are 


A 

n 


n+2 


(- 1 ) 


n-2 

£ (- 1 ) 
r = 0 

(r -even) 


r (r-1 )r 
(n+l-r)! 


(n-r)! 


(n 


-T T, ] A 

•1 -r)! r 


n(X + 1 + n) 


(3-12) 


where 3 = 2<* -1 , X = 2Qf . A , for n odd are all zero since the density 

n 

is symmetrical. Therefore, for given values of a , all the necessary 
coefficients, A^, can be calculated to solve for p($) in its series 
expansion. This was carried out on a digital computer for equal to 
1.5, 3, 10, and 30. The right half of the symmetrical density 
p(#) in (3-8) is plotted in Figures 6, 7, 8, and 9 for these values, along 
with the solutions to the Fokker -Planck equation for the same « . Note 
that the truncated solution converges radier quickly to the high density 
solution, and are practically equivalent for a ^ 3. In essence, this can 
be conjectured as the range of validity of the high density solution. 

The variance of the phase error, calculated from (3-8), is also shown 
in Figure 10, along with the variance of the high density solution, 
Equation (3-5), and that satisfying a linear relation in l/<* . Again, the 
results indicate that for or £ 2, the relation in (3-5) is valid for the 
second order truncation solution as well. 


- 35 - 


0.5 



<p ( RADIANS) 


Figure 6. 


- 36 - 



Figure 7. 


- 37 - 



Figure 8. 


- 38 - 



- 39 - 


-40- 



Figure 10. 


3.4 


Third -Order Truncation Solution 


The third-order truncation of Equation (3-6) gives 

\ P ,M (*) + -^t-p"(*) + (i + > p' ($) + {a - ia ] sinl p($) = 0 • 

40 

(3-13) 

To solve this equation the previous series method is also used. 

However, $ = 0 is no longer a singular point for this equation, and 
the series solution is simplified slightly to 

CO 

P(i) = £ A n * n A 0 4 0 . (3-14) 

n = 0 

The four boundary conditions used here are 

i) p(") = P(- TT ) 

TT 

ii) J p(i) d§ = l 

_rr 

iii) p'($ )| n = 0 

iv) p"(*)| n = 0. 

Boundary conditions i) and iii) imply all (n-odd) are equal to 

zero. Use of the same method to determine A (n-even) as was 

n 

used previously, yields the recurrence relation 

n+2 n _ 4 r 

-[ (n-2) (n+ X)] A _+ (-1) 2 L (-1) 2 

n -c. _ 

r = 0 

(r-even) 

r -r(r-l) 2r . 9 ,, ic» 

^ (n-r-1)! ~ (n-r-l)! + (n-r-3)! ^ A r / ( 3 “ 15 ) 

for n ^ 4 and , 9=2 Boundary condition iv) is used to 


^n n (n - 1 ) (n - 2 ) 


- 41 - 


determine A 


Substitution yields 


2 ’ 

£ n(n-l) A n TT n " 2 = 0 . (3-16) 

n = 2 


Since all the A (n s 2) can be written in terms of A n and A 9 
n U 

Equation (3-16) can be written as 


00 


n 

L n(n-l)TT n - 2 D 

A + 

^ n(n-l) TT n ’ 2 B 

L n=2 n - 

2 

Ln=4 n - 1 


A 0 = ° 


where D and B can be determined from Equation (3-15). Then 
n n 

A^ is 


A ? = ~ 


N n _2 

£ n(n-l)TT B n 

Ln=4 


£ n(n-l)TT n - 2 D 


n 


Aq is then determined by the normalizing boundary condition (ii), 
TT 

J p($) = 1 • 


These computations were also accomplished with a digital computer 
and the solutions for p($ ), 0 s s T f are plotted in Figures 6, 7, 

and 8 for a = 1 . 5, 3, and 1 0, respectively. For a s 1 . 5, the third 
order truncation solution is almost identical to the second order 
solution. 


- 42 - 


Accuracy of the Truncation Solutions 


3. 5 

The preceeding methods can be used to solve higher-order 
truncations of Equation (3-6) but the derivation of the expressions 
for become increasingly more difficult and computer time and 
size (memory) required increase quite rapidly. Therefore 
truncation solutions of order higher than three were not attempted. 

However, it would be of interest to obtain an indication of how well 
the truncation solutions were approximating the true solution to 
(2-45). In particular, it is desirable to justify the notion that each 
succeedingly higher-order truncation solution was a better approxi- 
mation to the total solution. This requires that the truncation solutions 
be substituted into Equation (3-6), and the magnitude of the remainder 
associated with the higher order neglected terms should be investigated. 

With this objective the solutions obtained for the first-order 
(Fokker-Planck) and second -order truncations were substituted into 
Equation (3-6) and the magnitude of the maximum value of the remaining 
terms were calculated on a computer. The results are plotted in 
Figure 11 for various values of ot . For example, when the first- 
order solution was used, the largest remainder was due to the second-order 
term, the next largest due to the third-order term, etc. In addition, 
the magnitude of the third-order term, when the second-order solution 
is used, is smaller than it was when the first-order solution was used. 

It is clear from studying Figure 11 that succeedingly higher-order 
truncation solutions result in smaller remainders, and therefore provide 
a more accurate approximation to the total solution. Note also that 
while Figure 11 plots the maximum magnitude of each term, the sign of 
the remainder terms alternate. Hence, the remainder appears as an 


- 43 - 



Figure 1 1 . 


- 44 - 


alternating series of decreasing terms and the magnitude of the 
remainder for an nth-order solution is bounded by the magnitude 
of the n+1 remainder term. For example, for ot = 10, the remainder 
for the Fokker -Planck solution is bounded by the second-order 
value of 0.26, and the remainder for the second-order solution is 
bounded by the third -order value of 0. 086, down 67%. 

From the data presented in Figures 6 through 11 it is 
indicative that for the low function density case higher -order truncation 
solutions to Equation (3-6) yield better approximations to the total 
solution of the infinite -order equation. It is also quite clear that as 
ot increases all the truncation solutions approach the first-order 
(Fokker -Planck) solution. In other words, the nth-order truncation 
solution may be represented by 

P(«) = ?!<*) + P*<*) (3 - 1 7 ) 

where Pj($) is the solution to the first-order (Fokker -Planck) 

!{! 

equation in (3-4) and P n ($ * a ) represents the difference between 
the nth-order and first-order truncation. As on gets very large 

a* 

lim p ' ($ ) "* 0 n > 1 

a — « n 

and 

lim p ($) -* p, (§) n > 1 . 

c*-* oo n 

The method of solution that has been presented here can reduce 
this error to as small a number as desired, in theory, given enough 
time and computer capacity. The third-order truncation solution 
was the highest-order computed in. this analysis and it is shown that 
this solution is a good compromise in the tradeoff between accuracy 
and complecity of solution for the range of s 1.5. 


- 45 - 


3. 6 


The VCO Offset Case 


It has been assumed that the carrier frequency of the optical 

modulating signal, , and the phas e -locked -loop VCO rest frequency, 

Wg, have been equal. When this is not the case the VCO offset, 

('D - uu i ), must be included in (2-37). This expression for ^ ^ is 
s U 

then modified to 

N(At) 

A$ = (uu _ uu ) At_ek ^ cos 9'(t ) 

' 0 ; i m 

m=l 

t * t s t + At . (3-18) 

m 

The K r ($) coefficients in the Smoluchowski series equation are 
modified only through the first one which becomes 

K L ($ ) = ('» - uu ) - j eAK sin§ . 

The effect of the VCO offset is such that p($) is no longer symmetrical. 
This means that the series method of finding solutions to truncations 
of the infinite-order Smoluchowski equation now has the odd as well 
as the even terms in the power series solution for p($ ). In addition, 
the constant Cq is no longer zero. 

As an example of the treatment of the VCO offset case a 
second -order truncation solution will be found. The pertinent 
equation is a modified version of (3-11), 

^r-p"(*) + (1 + — -*) P’ (*) + (“- Y) sin$p(*) = C Q (3-19) 

where 


- 46 - 


8(uj- y 


(3-20) 


y = 


k 2 (2ne^A) 


is the new parameter due to the offset. For a specific value of Y , 
(3-19) can be solved by the series method of the previous sections. 
For example, with Y= (.707)<* , the coefficients in the series 
solution become 


A i ■ 


^ C Q + . 3535 ^ A ( 
+ 2 


A 

n 

(n-even) 


. 3535^ 


Vi+f- 1 ’ 


n +2 n -2 
2 


r = 0 


(- 1 ) 
(r -even) 


[ ( r-l)r 
L(n-r+l )! 


2r 

(n-r)! 


n (n + ^ + 1 ) 


(n-r-1 )! -I' 


A 

n 

(n-odd) 


2+1 n-2 

. 3535X 2 Ati ,(-1) 2 S (-1) 

n_1 r=l 

(r -odd) 



)r 

L 


(r-l)r 2 r 

(n-r+1)! (n-r)! 


n(n+ ^ +1 ) 


0 

(n-r-1)! 


] 


A 

r 


where ^-=2 ot and 0=2^ -1. The two unknown constants, Cq and 

Aq , can be evaluated by using the two boundary conditions 

i) P( TT ) = P(- TT ) 

TT 

ii) J P($) d$ = 1 . 

_TT 


This was accomplished on a digital computer for = 1. 5 and 3 and 
the results are plotted in Figure 12 along with the first-order solution 
for & = 3. The obvious difference between this case and the non-offset 


- 47 - 



<f> (RADIANS) 


Figure 12. 


- 48 - 


case is that the peak of the probability density has now shifted 
from the $ = 0 center line. 

The two solutions for =3 show approximately the same 
relationship as in Figure 8 for the non-offset case. 

Higher-order solutions can also be obtained as in previous 
sections for the non -offset cases if additional boundary conditions 
are imposed to evaluate all the unknown constants of integration. 

The equivalent order offset solution, however, is obtained with more 
difficulty and complexity than in the non -offset case because Cq 
is no longer zero. 


- 49 - 


Chapter 4 

THERMAL NOISE AND PHOTOMULTIPLIERS EFFECTS 

In the previous chapters a first order phase locked loop driven 
by a shot noise process was considered. In this chapter we investigate 
the effects of additive thermal noise and photomultiplier devices 
preceeding the loop. 

4. 1 Additive Gaussian Thermal Noise 

Let r(t) represent a zero mean stationary Gaussian noise process 
having a flat one-sided power spectral density of N Q watts /hz. When r(t) 
is added to the shot noise input process of the phase lock loop of Figure 
(1-2), the output of the loop filter [previously (2-6)] is now 


t r N(0, t ) 

e(t) = k. J f(t-T )| ^ e 
0 m=l 


6( T - T ) cos9'(T) + r'(T) 


dT 


(4-1) 


where r'(t) is the "low frequency" equivalent noise process obtained 
by mixing the input noise r(t) with the VCO process. It has been shown Li ] 
that the new noise term is itself Gaussian, zero mean, with spectral 
density given by N^ ; (i. e. , r'(t) is simply a "frequency shifted" version 
of r (t)). 

When the transmitted phase variation, ®^(t), is a constant, the phase 
error derivative for the first-order loop has the form 


r N(t) 

— = " ke I £-i Mt-t ) cos9'(t) + r'(t), . (4-2) 

d t m — i m 

If this equation is integrated from t to t + At, the incremental phase 
error becomes 


- 50 - 


N(At) ( t + At 

A$ = e k cos9'(t )-k J r'(T)dT . (4-3) 

m=1 t 

The first term is identical to that previously derived in (2-37). The 
second term accounts for the added effect of the thermal noise. The 
coefficients of the Smoluchowski equation can now be recalculated for 
of Equation (4-3). In particular, K^($) remains the same as before: 


K x (* ) 


Ak . 
e — r — sm$ 


(4-4) 


since the expected value of the Gaussian process is zero. The second 
moment requires calculation of 


N(At) t + At 

Ei -ek^ cos6'(t )-k J r'(T)dTJ • (4-5) 

L m=l m f 


The expectation of the square of the first term has previously been 

calculated, the expectation of the cross term is zero, and the expectation 

2 , 

of the square of the second term is k Nq/ 2. Therefore 
k^ r 2 n 

K 2 ($) = y-Le A+N q J. (4-6) 

For computing the higher amounts, define 

NjAt) 

P = e cos 9 ' (t ) 

m= 1 m 


t + At 

G = J r'(T) dT . 
t 


Then, 


EC P + G] n = E(P n + a . P n " 1 G + a 7 P n " 2 G 2 +a P n " 3 G 3 + • • • ) . 

n-1 n-2 n-3 


- 51 - 


Since the Poisson and Gaussian processes are independent, this 
becomes 


E[ P+G] n = [ EP n + a n l E ( P n ' 1 ) E(G) + a^EfP"'^) E(G Z ) + • • • ] 


n -2, 


It has been shown [l] that foi 


lim 
At - 0 


At 


1 E(G n ) = 0 


n > 2 


The expectation of P n m has already been calculated in (2-33) and 
been found proportional to At. Thus 


lim 
At - 0 


5T a n-2 E < pn_m > E < Gm > = 0 m 


2 1 


and therefore, 




K (*) = lim E(P n ) 

n At “* 0 At 


n >2 


(4-7) 


which is the same as in the earlier section when no additive Gaussian 
noise was present. Hence, the Smoluchowski series equation has been 
modified only in the second term, K 2 ($). The solution for the probability 
density of the phase error again requires solution of (2-45) with the 
appropriate modification. It has already been shown that an excellent 
approximate solution for the high a case is the solution to the Fokker- 
Planck equation. For the new K^(i) term this becomes 


P(#) 


a 

e cos $ 

2"I 0 (QO 


(4-8) 


with the parameter a is redefined as 


- 52 - 


O' = 


(4-9) 



B 


L 


Ce 


'A + 


N o ] 


Note that the parameter a now takes on a slightly different meaning. 

The bracketed denominator term is the sum of the spectral level 
due to the shot noise and the spectral level of the additive Gaussian 
noise. Hence, the denominator represents the total effective noise 
in the 2B loop bandwidth, due to both the shot noise and additive noise. 

i-J 

The numerator is the average power of the intensity process. Thus, 

Qf now plays the role of an operating signal-to-noise power ratio in the 
tracking loop bandwidth. The dependence of p($) in (4-8) on a had been 
shown earlier in Figure 4, and the results there are valid with above 
interpretation of 


Effect of Fhotomultiplication 

In many optical systems photomultiplication is used at the photo - 
detector to enhance the received signal. The objective of this section 
is to investigate the effects of photomultiplication on the behavior of 
the phase error in a fii;st-order tracking system. 

An ideal photomultiplication of gain G has the property that it 
produces G electrons at the photodetector output for each photo-electron 
at the input. If the electrons are considered identical this has the 
effect of producing an equivalent electron pulse waveform whose magnitude 
is G times the magnitude of a single electron pulse waveform. Effectively, 
this increases the charge of a single electron by the gain G. The shot 
noise current of Equation (2-1) may then be written as 


- 53 - 


(4-10) 


N (t) 

x(t) = £ (eG)6(t-t ) . 

m=l m 

The pdf for the phase error in the high function density case is again 
given by (3-4), where the signal to noise ratio parameter a is now 

1 2 
7 (eGA f 

«= o (4-11) 

B C(eG) A+ N q ] 

and B is now eGAk/4. The photomultiplication advantage is easily 
JL* 

seen when the additive noise term of power spectrum level Nq is 
dominant. In this case an increase in the a parameter can be achieved 
by increasing the photomultiplication gain G. 

In the practical fabrication of photomultipliers the gain itself is often 
a random variable. In the following it is assumed that the photomultiplier 
has a statistically variable gain which is a random variable with mean 
G and mean square G^. This means that each electron at the input 
produces G electrons at the output, where G is a positive random variable. 
The shot noise current now becomes 

N (t) 

x(t)=E eG m 6(t-t m ) (4-12) 

m= 1 

where the (g^) constitutes a set of random variables, independent, 
and identically distributed over zero to infinity. The incremental change 
in the process is now 

N(At) ( t + A t 

= _ek £ G m cos 9 (t m )-k J r'(T ) dT 
m= 1 t 


- 54 - 


and the first two moments become 


k l ($) = 


ek G A 
2 


sin$ 


K 2 (*)= y C(ek) 2 G 2 (A) + N q ] . 


The signal-to-noise ratio ® is modified to 


or 


B 


\ (eGA) 2 
— 

Ce G 2 A + 

u 


with 


B 


L 


GAK 

4 


(4-12) 


Hence, the shot noise power spectrum is increased by the mean square 
of the gain, while the signal power is increased by the square of the 
mean gain. 

In some analyses it is common to assume the random gain is 
Gaussian with a mean G, and a standard deviation, or "spread", given 
as a fraction of the mean gain. That is, 



In this case the mean square gain is 

— _ , D 2 

G 2 - (G) (1 + j- ) 0 P < 1 

and Equation (2-41) becomes 


- 55 - 


a = 


1 _ 2 
2 < eGA ) 


B l L e 2 (G) Z (l + )A+ N q ] 


Note that the Qf parameter degrades as the "spread" parameter P 
increases . 


- 56 - 


Chapter 5 

SECOND -ORDER LOOP ANALYSIS AND 
THE GENERAL TRACKING LOOPS 

In this chapter the analysis of phase-locked-loops with shot 
noise inputs is extended to loops of order greater than one, and to 
the first-order loop where the input and feedback functions are not 
necessarily sinusoidal (general tracking loops). For the second-order 
loop a vector form of the Smoluchowski equation is used for the phase 
error probability density, and solution can be approximated under 
conditions similar to those of the first-order loop. For the general 
tracking loop, a generalized Smoluchowski equation for the probability 
density is used, and again can be solved by the numerical techniques 
presented in Chapter 3. 

5. 1 The Two Dimensional Smoluchowski Equation 


was derived in Chapter 2 for a general scalar random process 
$(t). The same basic procedure can be repeated for a vector random 
process, and a similar vector form of (2-28) will result. Specifically, 
if we denote 


as the two dimensional vector process having scalar random component 


The Smoluchowski-Kolmogorov probability density equation 


i(t) = Ujd), * 2 (t)3 


(5-1) 


processes {L(t)} , then the vector equivalent of (2-18) is 

t l ^ — 2 ’ t 2^ P ^2’ t 2^ d -2 



(5-2) 


where $. = {$. (t.), $ ? (t.)] . Defining the two dimensional equivalent 

1 A 1 L* \ 


- 57 - 


of the characteristic function in ( 2 - 19 ), and repeating the steps in 
(2-20) through (2-28) will yield the equation 


9P(|_, t) 
5t 


£ E 

m n 


where 


(-l) m+n 

m! n ! 


5 m+n 


Ck ($ ) p(0,t)3 

mn — — 


(5-3) 


K 

mn 


(i) 


Lim 
A t “* 0 


Et (A^) m (A* )] 
At 


(5-4) 


Equation (5-3) is just the two dimensional equivalent of the Smoluchowski 

equation in (2-28). Note that evaluation of the coefficients { K 
^ mn 

require all the statistical cr os s -moments of the joint variations A$^ 
and A in the components of the process $ (t). In the following 
section we apply (5-3) to a second order phase lock tracking loop. 


5. 2 Second Order Phase Lock Loops 

A second order phase lock loop is one in which the loop filter 
in Figure 2 introduces an integration. The basic form of such a 
filter would be one having transfer function 


F(s ) = — = 1 + - (5-5) 

s s ' 

where a represents a possible zero of transmission. The impulse 

response corresponding to (5-5) is then 

f(t) = 6( t ) + a . 

For a shot noise input, the general loop error dynamical equation in 
(2-9) now becomes 


- 58 - 


(5-6) 


Njt) N(t) 

d ^ = -ek cosQ'(t) ^ 6(t-t ) -aek^< cos^'ft ) . 

dt ' ' , m' , m 

m=l m=i 

We now see that the variation will contain a term in J C as in (2-37)] 
and a term involving an integral in $, due to the second term in (5-6). 

It is therefore convenient to define the vector 


y.(t) = ty 0 (t), y j (t)} 


where 

y x (t) = 


dy o (t ) 

dt 


and let 


$( T ) = y 0 (t) + y x (t) . 


(5-7) 


(5-8) 


(5-9) 


That is, we consider the error process in (5-6) to be decomposed into 
the sum of the components of a vector process ^(t). The probability 
density of $(t) is then determined from the joint probability density P(^, t) 
by the relation 


P($, t) = J CP(x, t)3 _ x dy, . (5-10) 

y 0 " *‘ y l 

Substitution of (5-8) and (5-9) into (5-6) yields 

dy Q (t) ^(t) _ 

a — y - — + aek ^ cos o' (y, t ) + 
dt m=l m 

dy 1 (t) _ N(t) 

— + ek cos6'(y,t) ^ &(t-t ) = 0 (5-11) 

dt 7 ' m' 

m= l 

where the dependence of 9' (t) on y(t) is emphasized. The above may 
be decomposed into the two equations 


- 59 - 


d v 0 (t) 

a dt 

dy x (t) 

dt 


N(t) 

+ aek Z cos 9'(y, t ) = 0 
m= I m 

N (t) 

+ ek cos 9 ' (y , t) S 5(t-t ) = 

i rn 


0 


where it is noted that the second equation is simply the derivative of 
the first. The above two equations may therefore be represented by 
the two first-order differential equations: 


d y Q (t) 

dt 


y L (t) 


dy^ (t) 
dt 


N (t) 

-ek cosQ'(y, t) Z 6(t-t ) 

m=l 


(5-12a) 


(5 - 12b) 


The above equations specify the dynamics of the vector process y(t) 
corresponding to (5-9). It is therefore possible to determine the 
equation for the joint density P(y, t) in (5-10) by using the two dimensional 
Smoluchowski in (5-3). The increments of the vector components 
given in (5-12) are 


Ay 0 = y i (t ) At 

t + At _ N ( T ) 

Ay, = -ekj cos9'(y, T) Z) 6(T _t ) dT 
t m=l m 

N (At) 

= — ek Z cos 9' (y> 
m= 1 


(5 - 1 3a ) 


(5-1 3b) 


These increments are needed to calculate the joint moments, K (v„, y, ) 

J mn 7 0 7 1 ' 

given by Equation (5-4). The are ca -l cu l ate d to b 6 


- 60 - 


K i 0 (y) = y x 

> ekA . , , 

K oi (y) = - “r" sin(y o + y i } 

K Q2 (y) = (ek) 2 (A)/2 

K on ( y ) = (" ek ) n cos 11 9' (y, t) n(t, 9 ) n s 3 
where again 

n(t, 9) = A + AsinC 1 *) t + 6. (t)l . 

S i 

All the other K (y) moments not listed above are zero. In this 
mn 

case the two-dimensional Smoluchowski series equation becomes 

9 p(y) _ 9 p(y) + 2 " £ x^ sin (y 0 + p(y) "1 

& y ! o y 0 L J 

(ek) 2 A d 2 p(y) 

4 .2 

9y l 

” / i \n 

+ £ ( — 1 —l K p(y)] . (5-14) 

n s 3 n ‘ 3 y J on 

The above again represents an infinite order partial differential 
equation for the joint density P(y) = P(^, t). Some simplicity is 
afforded by considering only the steady state solution, but the resulting 
equation is still difficult to solve explicitly without digital computation. 

For the case where the average intensity A is much greater 
than the loop bandwidth B (i. e. , larg electron density) the approximate 

X-i 

steady state solution to (5-14) can be found by limiting the number of 
terms involved. The corresponding steady state solution for $(t) from 
(5-10) is then approximately, 


- 61 - 


A > >B 


(5-15) 


P($) 


a cos $ 


2TT I 0 (Qf ) 


where a = A/2B t . but B now has the definition 

J_j 


B 


L 


eAk + 2a 
4 


(5-16) 


That is, the loop bandwidth B^ is increased by the added zero in the 
loop filter of (5-1). The high density solution for P($ ) is therefore 
identical to that of the first order loop case, with the adjustment in th 
B bandwidth. For higher -order loops an equivalent n-dimensional 

Jlj 

vector process must be defined and an n-dimensional Smoluchowski 
equation must be derived, increasing the complexity of the problem. 


5. 3 General Delay Tracking Loops 

The objective of this section is to investigate the behavior 
of a phase tracking system when the input intensity modulation signal 
and loop feedback function are to a general periodic nature, but not 
necessarily sinusoidal. Let the signal electron rate of Equation (2-2) 
be represented by n g (t, T ^ (t)) and the feedback function by y(t, T ^(‘t)) 
where (t) and T ^(t) are their respective time delays. The differential 
equation describing the loop operation for shot noise inputs, where 
is again assumed constant, becomes 

N{t) 

£liH=_ek^ 6 (t-t ) y(t, T 2 (t)) (5-17) 

m=l 

where T(t) = T^(t)-T^(t) and N(t) is again a Poisson random variable 
with intensity n g (t, (t)). The incremental delay error is 


- 62 - 


(5-18) 


N (At) 

A T = -ek £ y(t 
m=l 


m 




and the K^fT) moments of the Smoluchowski series equation are now 

, r N(At) -1 

K (T) = lim -rr- E -ek £ Y (t , \ (t )j (5-19) 

n At ^ m £ m 

** 0 m=l 

where the expectation is conditioned on T . Thus, Equation (5-19) 
becomes 

K n (i) = (ek) n Y n (t, T 2 (t)) n(t, T x ) (5-20) 

where the over -bar represents time averaging inherent in the 
loop mixer function. Hence, K ($) is of the same form as in 
Equation (2-39), and would be identical to it if 


n (t , T i ) = A + As in (w ^t + T ^ ) 
y(t, T 2 (t)) = S n(t, 0)/90 

= COS C" 0 t + T 2 (t)) . 


The third-order truncated Smoluchowski series equation 
for a general input function becomes 

2 2 

)_ 

2S T 

3 

[y 2 (t, T 2 (t) ntt.Tj) p(T,t)]+(ek) 3 [y 3 (t,T 2 (t))n(t,T 1 ) p(T, t )j 


^ y(t.V 2 (t))n,; Tl ) p(T ,t)] + ug* . 


(5-21) 


Here, attention is restricted again to only the first three terms of the 
infinite series equation, accepting the results as only an approximate 
solution. 


- 63 - 


The steady state version of Equation (5-21) occurs when the left-hand 
side is zero. Integrating the equation with respect to t gives 


C Q = ekLR^T) p(T)] + n C R 2 (T)] 

2 ^ n 
+ (ek ) 3 4[R 3 (T) p(T) 1 

d'r y n 

where 


R 2 ( T )= y 2 n 
y n 

R 3 CO = y 3 n 

y n 


(5-22) 


are correlation functions. Note that this equation corresponds to the 
previously considered equation (2-45) with the sinusoidal functions 
.replaced by the general time averaged correlation functions given 
above. 


5.4 An Example-Early Late Gate Tracking 

In rader and pulse tracking systems a periodic pulse train is 
locked to a locally generated periodic signal through a feedback tracking 
system, similar to that in Figure 2. When the two signals are in time 
lock, the local signal tracks the time variations in the arrival times 
of the incoming periodic pulse train. In optical tracking systems the pulse 
train is generated by a pulsed laser whose intensity is detected by a 
photodetector at the receiver. The feedback signal in the tracking loop 
is designed such that when it is multiplied with the detected pulse train 
and integrated over some period the result is an error function that is 
odd with respect to the time delay. This local signal is often designed 


- 64 - 


to be a periodic train of positive -negative pulses as in Figure 13. 
The multiplication of the received pulse train by this particular 


latter signal. Hence, the receiver is often called an "early late gate" 
on "split-gate" tracker. 

Since the output of a photodetector is a shot noise process with 
intensity n(t, T ^), the analysis problem is an example of the application 
of the general tracking theory of the previous section. By referring 
to Figure 13 it is easily seen that 


local signal is equivalent to "gating in" the former signal by the 


y 3 (t, T 2 (t)) = y(t, T z ( t )) 
y 2 (t, T 2 (t)) = 



and therefore 



= R 3 ( T ) 


y n 


T 

= y- J y (t, T 2 (t )) n(t, T x ) dt 


0 


(5-23) 


R nn H y 2 (t,T 2 (t))n(t,T 1 ) 



+ (a constant) . 


(5-24) 


Equation (5-22) now becomes 



(5-25) 


-65 - 


-66 - 


Input Signal 
n (t, 0 t ) 



Feedback Signal 
y(t, 0 2 (t) ) 


T 




2A+M 

r M 


1 


T 


Figure 13. 


Therefore, for the given input and feedback functions in Figure 13, 
Equation (5-25) could be solved as a second-order differential equation 
for the delay error density of an "Early Late Gate" tracking system. 
The solution would represent an approximate solution to the infinite 
series Smoluchowski equation. A computer solution similar to that 
used in Chapter 3 would be applicable to the solution of Equation (5-25). 


- 67 - 


REFERENCES 


1. Viterbi, A. J. , Principles of Coherent Communication , 

McGraw-Hill, Inc"! (1966). 

2. Davenport, Jr., W. B. , and W. L. Root, Random Signals and 
Noise , McGraw-Hill, Inc. (1958). 

3. Middleton, D. , Introduction to Statistical Communication Theory , 
McGraw-Hill, Inc"! (i960). 

4. Papoulis, A. , Probability, Random Variables, and Stochastic 
Proces ses , -McGraw-Hill, Inc. (1 965). 

5. O'Niell, E. and Karp, S. and Gagliardi, R. , "Communication 
Theory for the Free Space Optical Channel", Proc. of the IEEE, 
October 1970. 

6. Karp, S. , "A Statistical Model for Radiation with Applications 
to Optical Communications", Ph.D. Dissertation , University of 
Southern California (Jaunary 1 967). 

7. Anderson, L.K. , and B.J. McMurty, "Highspeed Photodetectors", 
Proceedings of the IEEE , Volume 54, Number 10 (October 1966). 

8. Reiffen, B. , and H. Sherman, "An Optimum Demodulator for Poisson 
Processes; Photo Source Detectors", Proceedings of the IEEE, 
(October 1963). 

9. Bar -David, Israel, "Communication Under Poissonian Regime", 
Israel Ministry of Defense, Scientific Department, Report No. 
40/07-526 (January 1968). 

10. Parzen, E. , Stochastic Processes , Holden Day, Inc. (1962). 

11. Lindsey, William C. , "Nonlinear Analysis and Synthesis of 
Generalized Tracking Systems", USCEE Technical Report 317, 
(December 1968). 

12. Pratt, William K. , Laser Communication Systems , John Wiley 
& Sons, Inc. , (1969). 

13. Coddington, Earl A. , and Norman Levinson, Theory of Ordinary 
Differential Equations , McGraw-Hill Book Company, Inc. (1955). 

14. Ince, E. L. , Ordinary Differential Equations , Dover Publications, 

Inc. , (1 956). 


- 68 - 


15. Boyce, William E. , and Richard C. DiPrima, Elementary 
Differential Equations and Boundary Value Problems, John 
Wiley and Sons , Inc., (1965). 

16. Kolmogorov, A, N. , "On Analytical Methods in Probability Theory 
Math. Ann., Vol. 104, pp. 415-458 (in German), (1931). 

17. Stratonovich, R. L. , Topics in the Theory of Random Noise , Vol. 
New York: Gordon and Breach, (1963). 


- 69 - 


/ 9 / 5 - o o/ 


January 1971 USCEE Report «*/ A I 


Interim Technical Report 

Counting Statistics for Extended 
Optical Photodetectors 



R. Gagliardi 
U. Farrukh 


Department of Electrical Engineering 
University of Southern California 
Los Angeles, California 90007 


This work was sponsored by the National Aeronautics and Space 
Administration, under NASA Contract NGR-05-01 8-1 04. This grant 
was part of the research program at NASA's Goddard Space Flight 
Center, Greenbelt, Maryland. 


Page intentionally left blank 


i 


ABSTRACT 


Recent attention has been devoted to the derivation of the 
probability density of the number of photoelectrons occurring (count 
density) at the output of a photodetecting surface when receiving an 
optical field. Previous analysis has been basically confined to 
point detectors, in which the spatial extension of the detector 
surface over the normal optical field has been ignored. In this 
report count density analysis is generalized to account for the 
extended detector. The accepted mathematical detector model is 
developed to cover both time and spatial counting, and the integral 
equations necessary for exact analysis are developed. Approximate 
solutions to these equations are presented for some cases of 
practical interest. Knowledge of counting statistics is of utmost 
importance in the optimal design of an optical communication or 
tracking system. 


ii i 


1.0 Introduction 


In optical systems, knowledge of the statistics of a photodetecting 
receiver is necessary for the application of optimum detection and estimation 
procedures. The statistic of most importance in an optical communication 
system is the probability of the number of photoelectrons produced at the 
photodetector output when receiving a statistical field. Although this 
detection operation is basically quantum mechanical in nature, the density of 
the electron occurrences can be theoretical ly derived by using a semi-classical 
approach to the detector operation. This method can be used for developing 
probabilistic models while avoiding the basic physics underlying the receiver. 
In past work, the problem of determining photoelectron probabilities has been 
almost exclusively confined to photodetectors in which the spatial extension 
of the detector has been ignored, and only temporal effects have been 
developed. In this report the above approaches are extended to include the 
spatial effects of the detector during the photodetection operation. 


- 1 - 


2.0 The Optical Photodetector 


An optical detector is a photosensitive surface that responds to incident 
optical radiation by releasing electrons. These electrons are capture] by an 
anode plate, producing an electron current. The release of the electrons is 
strongly influenced by the incident optical field, but basically behaves as a 
random phenomena. The resulting electron current therefore evolves as a 
random process, and is mathematically modeled as shot noise whose statistical 
behavior is directly related to the number of electrons produced at the anode. 
The number of electrons produced is called the electron "count" and its 
associated statistical properties are referred to as "count statistics". Of 
particular significance is the probability density of the number of electrons 
produced during a given time interval from the entire detector surface, when 
receiving a given optical field. 

The accepted mathematical model ^ of a photodetector is derived using 
a semi-classical approach, which treats the electromagnetic field classically, 
but prescribes a probabilistic solution to account for its interaction with 
the atomic structure of the detector surface. Although a complete description 
of the emission and absorbtion of light by an atom influenced by a radiation 
field is well beyond the scope of this report, an outline of the approach as it 
is related to count statistics is presented below. 

The semi-classical derivation begins with the Hamiltonian equations for 
a charged particle in an electromagnetic field. It is then assumed that the 
combined system, atom plus radiation, begins in some initial state, and a set 
of coupling equations are derived for the transition probabilities, from which 
one can determine the probability rate of finding the combined system in a 


- 2 - 


given final state. Summing over all final states, and making some simplifying 
assumptions, one ends with Fermi's rule for the probability per second for a 
state transition over a differential area A£ located at point on the 
detector surface. 


Here, B is a proportional i v constant and I(r.,t) is the received normal 
electromagnetic power at time t and point on the detector surface. The 
primary consequence of the Fermi rule is that it implies that in a short time 
interval At, the probability of ejecting an electron from an atom at the 
elemental surface area A_r is proportional to the incident radiation energy 
over Ar and At. That is. 


for sufficiently small A_r and At. In addition, (2) implies that the proba- 



( 1 ) 


Probability that an electron 

is released from area (r_ + Ar) = 3l(r,t)ArAt 
during time interval (t, t + At). 


( 2 ) 


o 

bility of more than one electron being released must go to zero as (ArAt) , 
which means 


Probability no electron it 

released from area (r + Ar) = 1 - 3l(j%t)ArAt 

during time interval (t, t + At). 


(3) 


- 3 - 


as ArAt -+ 0. Note that (2) states that the release of an electron from any 
elemental area at r_ at any time t depends only upon the radiation energy at 
that time and point, which implies that the release of electrons from disjoint 
differential areas on the surface, and from disjoint time intervals, can be 
treated as independent events. This assumption, along with Equations (1) and 
(3), describe the mathematical model of the photodetecting surface, and will 
be of primary importance in the subsequent derivation of the total electron 
count. 


- 4 - 


3.0 Derivation of Count Densities 


The probability density of the number of electrons produced during a 
time interval (t, t + x) over a total surface area A can be derived using 
the photodetector model of the previous section. To facilitate this derivation, 
we introduce the notion of a time-space domain. This domain contains vectors 
whose components correspond to time and spatial coordinates associated with the 
detector surface. For simplicity, we write these vectors as v_ = (t,rj where 
t is the scalar time component and r_ represents the two-dimensional spatial 
coordinates of the detector surface. We define the volume V in this domain 
to be composed of all vectors v_' = ( t ' / Jr 7 ) such that t <_ t ' £ t + x and 
r/ffA, where A is the spatial area encompassed by the detector surface area 
and x is the counting time. This allows us to denote the normal electro- 
magnetic field intensity (power) at point r* on the detector surface at time 
t' by I ( t ' , r.' ) = I ( v ' ) . This means the volume V is basically the set of all 
points in the time-space domain over which we observe the radiation field with 
a given detector in a given time interval. 

Now consider the partition of the volume V into disjoint cells 

AV = ArAt. We shall assume A£ and At are smaller than the spatial and time 

variations in I(t,r) to insure that within each AV, I(vJ is approximately 

constant. (This is always possible with continuous fields.) Let Av be the 

volume of the cells aV, and let n be the total number of cells in V after 

partitioning. The ensemble of n disjoint cells AV can now be ordered to 

form the sequence {AV, ,aV 0 , . . .aV } where each AV. is centered at some point 

i c n i 

v.| in V. Note that each a V^ can be interpreted as an observation cell 
corresponding to an elemental surface area and elemental time interval over 


- 5 - 


which we observe the radiation field. In this notation, the model of the 
detector in the previous section has the property: 


* ' 

Probability of an electron 

emitted from AV^ at v^. 


81 (v i )Av 


( 4 ) 


We now consider the probability of the detector releasing k total 
electrons from the total surface area A during the total time interval 
(t,t+t). This is equivalent to the compound probability that k electrons 
are emitted from the totality of all cells {AV^ } spanning V. This can be 
written as 



Probabil ity of 


Probability of 

x. V 

one electron 


no electrons in 

k' 

from k different 


the n-k remain- 

' all. 

ordered cells. 


ing ordered cells. 

orderings 



J 



E 

all 

orderings 



. n . 

)...I(v. )(Av) k 

k q=k+l 



(5) 


where the summation must consider all possible orderings of the n cells without 
repeats, and the division by kj is necessary since particular arrangements 
involving the same k cells need only be considered once. Note that (5) has 
used (3), (4), and the assumption of independent electron emissions from dis- 
joint cells. 


- 6 - 


We are interested in the limit of (5) as Av -*• 0, r, -*• °°, so that the 
approximation indicated becomes a true equality. Since the limits of sums and 
products is equal to the sum and products of the limits, we can investigate the 
limit of the individual terms in (5) and recombine. We first show that the 
product term has the same limit for all orderings. This can be. seen by con- 
sidering the limit of the logarithm of the product. The log is 


log 


q=k+l 


1 - gl(v. ) Av j 

q -I 



log 


q=K+i 


[i - ei(v. ) Avj 


( 6 ) 


Adding and subtracting the terms omitted in each ordering allows us to always 
write (6) as 

n r 

y~Jog 1 - 61 ( v i ) Av 


\-H 109 [ 


1 - 61 (v . )Av 


(7) 


Now, in the limit as Av -*■ 0, q -*■ °° and 

Lim log [1 - $I(v. )Av] -BI ( v n - ) Av 

Av -*■ 0 


( 8 ) 


The first summation in (7) therefore has the limit 


Lim 

Av •+ 0 
n °° 



log [1 - Bl(v )av] 


Lim 

Av -*■ 0 
n ■* °° 


n 

3 ^ [-I(v q )Av] 
q=l 



( 9 ) 


- 7 - 


while the second summation in (7) involves only a finite number of terms, has 

the limit 


Lim 
Av -* 0 
n -*■ °° 



-I 


fa/) Av 

a ■ 


( 10 ) 


for each possible ordering. Hence, in (5) 


Lim 
Av ^ 0 


q 


n 

r 



r 



1 - 81 (v i ) Av 

-»■ exp 

-8 

I (v^) dr_ 

k+1 

L q J 



V 


(ID 


Now consider the summation term in (5)* 


Lim 

Av -> 0 



all 

orderings 



)...!(». )(Av) k 
k 


( 12 ) 


Since each ordering above requires i-| f i^ i^, the summation can be 

written as 


n r 


E-E E 


V 


v 1 


v 


*j m > ') 


Sum over all 
orderings in 
which at least 
two i a are 
equal . 


(13) 


k-1 k 

The second term will involve n terms of order (Av) . Since n behaves 
as 1/Av, the limit of the second term will be zero as Av ->-0. The first 
term in (13), however, has the limit 


- 8 - 


( 14 ) 


Lim 
Av -*■ 0 



Therefore, using (9), (10), and (14) in (5), we derive the desired probability 
of k emissions over V as 


P(k V) 


(%) k 

— exp (-m y ) 


(15) 


where 




n(t,r)dtd£ 


(16) 


and 


n(t,r) = Bl(t,r) = 3 


f(t,r) 


2 


The probability that exactly k electrons will be emitted is therefore related 
to the integral of the squared field envelope over the desired observation 
volume. The parameter k can represent any non-negative integer, and there- 
fore (15) represents a probability over all integers (0,°°). Note that the 
normalized intensity n(t,_r) effects the count probability only through the 
functional m y . We have explicitly indicated the dependence of the probability 
on the counting volume V, which in turn depends upon the location and size 
of the detector area A, the counting interval t, and in particular on the 
specific time parameter t. Thus, the probability in (15) is in general 


- 9 - 


non-stationary in time. This dependence upon time and spatial area A will be 
an important aspect in subsequent analysis. 


If the optical field f(t,r) is constant over A at any time t 
( i . e . , f(t,r.) = r Q f(t) for all r*^)» then the spatial effects can be removed, 
and (16) becomes 


m 

v 



r 0 f (t) 


dtdr = Bar 



f(t) 


dt 


(17) 


2 

where a is the integrated area over the detector surface and r Q is the 

field power per unit area. The detector is said to be a point-detector , since 

it basically collects power at a particular point in space, normalized by area 
2 

power ar Q . This latter effect can be incorporated into the coefficient B, 
which can now be redefined as 

a s ar Q 2 B (18) 

Thus, when dealing with point detectors in (15), the previously defined time- 
SDace intensity n(t,r.) = B|f(t,r)| can be replaced by the time intensity 
function r»(t) = a|f(t,r)| 2 in (16). 


- 10 - 


4.0 The Karhunen-Loe»e Expansion of the Field 

The key to further investigation of (15), using (16), depends upon the 
ability to expand the radiation field f(t,r_) into orthonormal Fourier 
series over the observation volume V: 


where 


f (v) = ^ Vi w 

i=l 


(19a) 


f i 


•( 


f(v)(j>.(v)dv 


t+T 

u 


f (t,r)<|) i (t,r)dtdr 


09b) 


are the complex Fourier coefficients and ( 4> ^ ( v.) > represents a complete set of 
complex orthonormal bases functions over v^V. That is. 


VVj(v)dv = 
V 



-j ( t » n) <l> j ( t ,r_) d tdr_ = 



( 20 ) 


If f(t,rj is squared integrable over V, then the equality in (19a) is in 
the squared integrable sense, and the convergence requires only a bounded energy 
constraint on the radiation field over V. When f(v_) is a stochastic f i el , 
the coefficients (f^ } become complex random variables, and the representation 
in (19a) is in a mean square sense. In the latter case, if the orthonormal set 
{<Vv.)} 1S selected so that 


- 11 - 


K f(ll >l 2 )<f) i (v 1 )dv 1 = Y i ( J) i (v 2 ) 

V 


( 21 ) 


where the Kernel 



( 22 ) 


Then the random coefficients {f^} are uncorrelated, and the expansion in 
(19a) is called a Karhunen-Loeve ( K-L ) expansion. The function K^v^ .v^) 
is the covariance function of the radiation field, and is obtained by averaging 
in (22) over the statistics of the field. The convergence in (22) then requires 
only the squared-integrabil ity and continuity of the covariance function 
Kf (y^| ^ 2 ) over V. The orthonormal set {<f>.j(yj} that are solutions to the 
integral equation in (21) are called eigenfunctions of the integral operator 
with kernal ^(v^.v^), ar| d the y. are the associated eigenvalues. These 
eigenvalues of the K-L expansion are particularly important to our interests 
since 


E fi 2 = E f(v 1 )c|> i (v. 1 )dv . 1 f*(v 2 )* 1 (y 2 )dy 2 




V V 




V 


(23) 


-12- 


where we have used (19b), and the fact that are orthonormal over V. 

Thus, the eigenvalue y^ is the mean square value of the random variable f ^ . 

With the K-L expansion in (19 ) we can now substitute into (16) and 

derive 


m.. = 


( e|f(v) 

*T/ 


"d v 


■ 1 . 




i=l 


Z -k ★ 

f.cj) (v) dv 

U-r ' 


OO oo ~ 


(24) 


i=l j=l 


where the interchange of summation and integration is guaranteed by the con- 
vergence of the series. The orthonormal i ty of the basic functions then yield 

OO 

i=l 

Thus, we have expressed the counting level as the sum of the magnitude squared 
of the coefficients. Note that the particular observation volume V over 
which we collect radiation, and which depends upon the detector area A and 
the counting time interval (t,t+r), is implicit in the computation of the 
coefficients on the right. 

The expansion of m y in (25) as a sum of random variables affords 
us a convenient systematic approach for determining the count probabilities in 


f . 

i 


(25) 


- 13 - 


(15). We can either attempt to compute the probability density of the sum in 
(25), and evaluate (15), or alternatively, compute the characteristic function 
<|> m (uj) of the sum in (25), and evaluate p(k ; V) by [2] 


00 



(26) 


The computation of the characteristic function 4> m (w) however requires joint 
statistics over all the (f ^ . (The K-L expansion produces uncorrelated 
coefficients, but the joint density is still needed.) By considering only 
Gaussian fields, this latter obstacle is avoided. This is due to the fact that 
uncorrelated Gaussian random variables are independent. A Gaussian field is a 
stochastic field f(t,jr) and is a Gaussian random variable at every t and 
r_. For Gaussian fields, the K-L expansion has the added feature of producing 


independent Gaussian random coefficients f^ in (19 ). It then follows that 


■ 


00 


1 

exp 

3 

s i 

2 1 

i= 


1-By.jW 

1- 

■3y n -w 


(27; 


where s^ is the mean (signal) value of f ^ and is the eigenvalue in (21). 
Note the above characteristic function appears as a product of individual 
characteristic functions. This means it can be enterpreted as the character- 
istic function of a sum of random variables {k^ } , where each k^ has the 
probability density 


- 14 - 


p(k i ) 


k i 

(B Yi ) 1 

1 

(l+BY i ) 


exp 


r i ,2 


i 1 2 1 


1 

l s il 

l+6y i 


Y i d + 8Y i ) 


(28) 


't* h 

and L| < ,(*) is the k u order Laguerre density in argument (•)• Thus, for 
stochastic Gaussian fields, the count statistic K can be interpreted as the 
sum 


K = 



K i 


(29) 


of independent random counts , where each k^ has a Laguerre count proba- 
bility given by (28). Each k^ can be considered the random count associated 
with a time-space mode of the Gaussian field. The modes, however, have specific 
meaning, since they must be associated with the particular orthonormal set of 
the K-L expansion. Each such mode contributes an independent Laguerre count 
variable to the total count. Note that if s^ = 0 for some i, implying no 
deterministic component in that particular mode, (28) becomes 


P k .(tf 


(By/ 

(HBYj) 1+k 


(29) 


which is the Bose-Einstein probability. Hence, any mode of the Gaussian field 
tnat does not contain a signal component (zero mean value) contributes a Bose- 
Einstein count variable to K in (29). We emphasize that the Laguerre densities 
in (28), and the Bcse-Einstein densities in (29), require knowledge of the eigen- 
values {y.j}, which in turn require solution of the multi-dimensional integral 


- 15 - 


equation (21,) associated with the K-L expansion. For certain special types 
of stochastic Gaussian fields, the solution is somewhat simplified, as 
discussed below. 


- 16 - 


5.0 Stationary, Homogenious, and Coherence- 
Seperable Stochastic Fields 

A stochastic field is said to be temporally stationary if the time 
dependence in coherence function K^(]^ .r^t-j t 2 ) depends only upon the time 
difference t-|-t 2 . The field is spatially homogenious if the spatial dependence 
in the coherence function depends only upon the spatial distance (r,-r 2 ). A 
field is completely homogenious if it is both temporally stationary and 
spatially homogenious . 

A stochastic field is said to be a coherence- separable field if its 
coherence function factors a 

K f(ll »)-2 ;t lV = K s^-1’-2^ WV ( 3 °) 

The factor K s ( ^ , r_ 2 ) 1S Then called the spatial (mutual) coherence function, 
while K^t-j^) is called the covariance function of the field. A field that 
is coherence-separable and temporally stationary [i.e., K t (t-|,t 2 ) = K t (t-|,t 2 )] 

is said to be a spectral l.y pure field. 

An optical field is completely space coherent over an area A if 
K s ( 1 ^ 1 , r. 2 ) = 1 Tor all r^ ,r_ 2 in A. It is completely space incoherent if 
K s (r|,;r 2 ) =0, r^ f r^. Otherwise, it is partially space coherent . For 
coherence-separable fields, the K-L expansion has the property that the left 
side of the eigenfunction equation in (21) becomes 


- 17 - 



K $ (r.-, .r 2 )K t (t 1 ,t 2 )4>. (t ] ,r 1 )dt 1 dr 1 



^s ( — ! ’—2) 



K t (t 1 ,t 2 )4> i ( t ] ,rj )dt ] djrV| 


(31) 


The form of this shows that an eigenfunction solution to (21) will occur as 


4>i (t ,r) = g- ( t)h (r) 

(32) 

*1 = W js 

(33) 

where the above terms satisfy 


f t+T 

l Ktft, .t 2 )g i (t 1 )dt 1 = Y 1t g 1 (t 2 ) 

(34) 

) K s <n, .r 2 >hj (r, )dr, - v js h.(r 2 ) 

(35) 


A 


That is, for coherence-separable stochastic fields the eigenfunctions and eigen- 
values will factor into a product of time and spatial components. Thus, the 
eigenfunctions over V in (21) can be determined by separately determining the 
family of eigenfunctions for both the time and space coherence kernals in (34) 
and (35). Since the product of any two such time and space eigenfunctions will 
satisfy (31), the set of all eigenfunctions (4>.j(y_)} in (19 ) must involve all 


- 18 - 


possible pairwise products of the (g ^ ( t ) } and (h.(_r)}. The K-L expansion 

J 

then takes the form 


f(t,n) = £ 

1=1 j=i 

where 

t+T 

f iJ = J j f(t,r)g i (t)hj(r)dtdr 

A t 


(36a) 


(36b) 


The functions (g^(t)} define the temporal modes of the received field, while 

the functions { h . (_r ) } designate its spatial modes. The corresponding (y - . } 

and {y. } eigenvalues are the mean square power in each mode. If the field is 
J ^ 

2 

completely spatially coherent over A, such that K (£| .r^) = r Q over all 

r, ,£2 in A, then (35) is satisfied with the single eigenfunction h(r) = 1 

2 

and eigenvalue y g = r Q a. Thus, the field will have only one spatial mode in 

2 

A. Substitution into (36) also shows that f(t,r) = r Q af(t), as was assumed 
in the definition of the point detector in Section 3. The point detector 
assumption therefore is valid for stochastic fields whenever the field is com- 
pletely spatial coherent over the detector area, or equivalently, when only one 
spatial mode exists. 


- 19 - 


6.0 Solutions of the Spatial Integral Equation 


Computation of the count statistics require the solution to the integral 

equations in (34) and (35). Both equations are basically identical in form, 

although the former involves the scalar parameter t, while the latter involves 

the space vector r_. Solutions to the time equation (34) have been considered 

r?i 

in the literature. J In this section we derive some approximate solutions for 
the spatial equation in (36). 


A. Rectangular Detector 

Consider a rectangular photodetector of length a, width b, and 
area A = ab, receiving a coherence separable field from a source at a dis- 
tance R. Using the Fresnel-Kirckoff approximation, the spatial coherence 
function over the detector surface A is given by^ 


-1 

K s (Ll»l2^ = &(l2"-l) exp 



(37) 


where A is the wavelength, A is the area of the detector, and (_r) has a 
Fourier Transform called the source radiance B(u): 


3(r) 


/ 


B ( uj exp [j 2v ( u/ r) /AR] du^ 


(38) 


Here the integration is over two-dimensional space of ^vectors. By writing 


^Helstron, C. W. , J. Opt. Soc. Am., Vol . 59, March 1969 
"Detection of Objects Through Turbulent Medium", p. 333 


- 20 - 


the eigenfunctions in (35) as 


hj(r) = hj(r) exp j^- jTTjr 2 /AR J 


(39) 


we can rewrite the spatial integral equation in (36) as 


rjs^Ig) = [Ab(o)] -1 J 3 (£ 2 “Hi ) h(£j )d£-| 


(40) 


Now, if the source subtends from the detectorasol id angle much less than 
X /A, then 3(rJ ~ 3(o) over the detector area, and the detected field 
possesses complete coherency. Thus, the point detector model of the previous 
section is valid, and the source can be considered a point source. When the 
source is so large that its solid angle, when viewed from the detector, spans 

p 

many multiples of X /A, its coherence function over the detector surface 
area is generally smaller in area than the detector itself. For the rectangular 
detector, the eigenfunctions in (40) can be approximated by two-dimensional 
spatial plane waves 


h.(r) = exp [j2TTqx/a + j2n£y/b] (41) 

J 

where x and y are components of £, and q, l are integers. That is, 
the eigenfunctions can be taken as spatial harmonics, with spatial frequencies 
that are multiples of the period a and b. If we substitute with (38) and 
( 4i ) , the right side of (40) becomes 


- 21 - 



where u-j and are the components u.. The right hand integral integrates 
to approximately 



(43) 


so that (42) becomes, approximately, 


A 2 

A3(o) 


b ^r _ exp 


i M X + i M. y 
J a x 2 J b y 2 


(44) 


where Bju^Ug) = B(u). Thus, substituting (44) and (41) into (40), we have 


y(q>£) 


A 2 R 2 ln/qXR HAR\ 

A67-J B \V • — ) 


(45) 


Therefore, the spatial eigenvalue associated with each of the two-dimensional 
harmonic spatial frequency (q/a, l/b) is given by the evaluation of the 


- 22 - 


source radiance function B^.u^) at the point ( u -j = qa Q , = £b Q ) where 
a Q = AR/a, b Q = AR/b. Thus, a significant spatial frequency harmonic, or 
spatial mode, will exist for all (q,£) combinations such that (45) is non- 
negligible. Note the above is equivalent to effectively sampling B(u) at 
points separated by a Q = AR/a in the u-| dimension and by b Q = AR/b in 
the U 2 dimension. This means that if s Q is the two-dimensional spatial 
area over which the transform of B(r) is non-negl igible, then the number of 
significant spatial modes (harmonics) can be determined by partitioning s Q 
into grids, or disjoint squares of width a Q and b Q and determining the 
number of such squares needed to cover s Q . This means 

area s 

[number of spatial modes] ~ — — 0 + 1 (46) 

3D 
0 0 

where the one factor is needed to include the case where s„ « a b . Note 

that the area a Q b o when projected to the source at distance R subtends at 

the detector a solid angle a Q b o /R 2 = (AR/a) (AR/b)/R 2 = A 2 /ab = A 2 /a, which 

is the diffraction limited field of view of the detector. Therefore, the 

number of significant spatial modes over a rectangular detector surface can 

2 

alternatively be viewed as the number of solid angles A /A needed to cover 
the solid angle subtended by the radiance function B(uJ located at the source. 
Thus, we can also write 

p 

area s /R (/4)(area s ) 

number of spatial modes ~ + 1 = x-o + 1 ( 47 ) 

4M a 2 r 2 


- 23 - 


Note that the above interpretation effectively replaces the true source by a 
fictitious source whose spatial area corresponds to the area of the irradiance 
function B(uJ. 


B. Circular Detectors 

Now consider the case where a circular detector of radius a is 
used, and assume a coherence-separable field as in part (A) having a coherence 
and radiance function possessing circular symmetry. That is, in (38) 


B(r. 2 -r-|) = B( | JC-2 - — l I ) = 
B(u) = B ( | u | ) 


(48) 


When the detector radius a is larger than the width of the coherence function, 
an approximate solution to (40) is given by 

= C KiA (b Km r) cos m0 r (49) 

where are normalizing constants, {b^a} are the zeros of the Bessel 

function J m (x), and 0 is the angle of the point at distance r. Thus, (49) 
is the circular equivalent to (41). To determine eigenvalues we substitute 
back into the integral equation in (40). The right hand side becomes 


- 24 - 


(50) 


/e(t- 


| B(r-s 


i) h Km(i) d i = [ A B(o)]" | B(r-s) C^Jjb^s) cos(m6jds 


Km nr Km" 


where [aB(o)]~^ is absorbed into C Km . Using the transform identify in (38) 
in polar coordinates yields 


B(r-s) = J J 0 (p lr-s. 1 ^ dp/ 2 -rr 


r 

X (2 ' V pB (s £ ) J q (pr)J q (os) cos m (V e s )dpde s 

q=0 0 


(51) 


and integrating out 6 , allows us to rewrite (50) as 


C„ m cos me^ 
Km r 


f pB (^) 


J m (pr)dp V c) 


(52) 


with 


■r 


F K>> = sJ m (ps)J m (b Kn, s)ds 


(53) 


The term in (53) is approximately 6 (p-b^J/p , so that (52) is 


C Kn. cos(me r )J m (b Kn, r > B ( XRb Kn/ 2 "> 


(54) 


- 25 - 


This identifies the eigenvalue in (40) as 


*Km = B f XRb Kni/ 2lT ) 

= BCXRX,^/ 2ira) (55) 

where is the k th zero of J m (X). 

Thus, the eigenvalues are obtained by sampling the circular radiance 

function at distance X Knl AR/2iTa. The number of significant spatial modes could 

be identified similar to the previous example if the radiance area is taken as 
2 

7Ta o . Then the angle subtended by it at the source, when viewed from the 
2 2 

detector is rra Q /R . Since each resolution area in (55) occupies a solid angle 
2 2 

of A /na , the number of distinguishable modes is given by 

2 /d 2 
ira /R 

number modes = — ^ — *— + 1 

AVira 

(area radiance) (area detector) + ^ 

a¥ 

The result is the same as (47), with the rectangular areas replaced by the 
equivalent circular areas. 


- 26 - 


AN EXAMPLE 


WHITE BANDLIMITED, RADIANCE LIMITED, GAUSSIAN FIELDS 


The results of the previous section can now be applied to the most 
practical example - that in which the coherence-separable Gaussian field can 
be considered to have a flat power spectrum over a bandwidth B, and a flat 
radiance function over a spatial area a Q (projected normal to the detector 
line of sight). Let the detector itself have area /A and observation time 
T, located R units from the source. For this case, the number of temporal 
modes is known to be 2BT + 1 , where all modes have identical eigenvalues 
Y^. The number of spatial modes is M + 1, where M = Aa Q /A R , and each has 
identical spatial eigenvalues y s (since samples of the radiance function are 
now all equal). This means the observed field, over a detector area and 
observation time T, has a total of (2BT + 1)(M + 1) independent modes, each 
of eigenvalue Y $ Y t * The resulting characteristic function in (27) is a 
product of a finite number of identical functions, and has the form 


*>) = 


’ 1 

cxd \ NeEw 1 

1-By^ 

6XP [1-BYu>_ 


(57) 


where 


Y = Y s Y t 
N = (23T+1)(M+1) 


- 27 - 


Substitution into (26) can now be easily transformed yielding 


p(k,V) 


k 

X_ 


(1+Y) k+N 


exp 





( 58 ) 


where E is the total signal energy over all N modes and p(k,V) is the 
probability of k counts occurring over the volume ^ x T, Note that the 
resulting density depends only upon the total signal energy t, the individual 
eigenvalues y, and the total number of modes N. Thus, the primary result of 
extending the point detector to an extended spatial detector is to increase the 
effective number of modes over the given observation time T. 


- 28 - 


References 


[1] S. Karp, E. O'Niell, and R. Gagliardi, "Communication Theory 
for the Free Space Optical Channel", Proc. IEEE Vol . 58, 

p. 1611 , October 1970. 

[2] S. Karp and J. Clark, "Photon Counting: A Problem in Noise 

Theory", Trans, on Information Theory Vol. IT-16, No. 6, 

p. 672, November 1970. 


- 29 - 


#ts-o?6o/ 




Af-ary Poisson Detection and Optical Communications 

ROBERT M. GAGLIARDI, member, ieee, and SHERMAN KARP, member, ieee 


REFERENCE: Gagliardi, R. M., and ‘Karp, S.: Af-ARY POISSON 
DETECTION AND OPTICAL COMMUNICATIONS, University 
of Southern California, Los Angeles, Calif. 90007, and NASA 
Electronics Research Center, Cambridge, Mass. 02138. 'Formerly 
with the University of Southern California. Rec’d 2/19/68; revised 
9/3/68 and 12/20/68. Paper 69TP6-COM, approved by the IEEE 
Communication Theory Committee for publication without oral 
presentation. IEEE TRANS. ON COMMUNICATION TECH- 
NOLOGY, 17-2, April 1969, pp. 208-216. 

ABSTRACT: This paper presents an investigation of the problem of 
maximum likelihood detection of one of M Poisson processes in a 
background of additive Poisson noise. When the observables cor- 
respond to counts of emitted photoelectrons, the problem models a 
discrete version of a coherent Af-ary optical communication system 
using photon counters in the presence of background radiation. 
Consideration is given to an average distance and a detection 
probability criterion. The advantages of an Af-ary pulsed intensity 
set (Poisson intensities wholly concentrated in a single counting 
interval) are demonstrated. The performance of such intensity sets 
is exhibited in terms of error probabilities, pulse widths, signal-to- 
noise ratio, and channel capacity. Behavior as a function of number 
M of intensities is also discussed. By appropriate conversion these 
results may be used for determining power requirements in an optical 
pulse position modulation system. 


I. Introduction 

T HE APPLICATION of detection theory to optical 
communications has been a subject of increasing 
interest. Since the output of a photodetecting surface is 
often modeled as sequences of electron “counts,” and 
since optical photoelectrons have been generally accepted 
as obeying Poisson statistics, the analysis problem is 
basically one of signal detection involving Poisson pro- 
cesses. The problem was first formulated in this context 
by Reiffen and Sherman [l], and further contributions 
were made by Abend [2], Ivailath [3], and Helstrom 
[4], In this paper we investigate the general problem 
of Af-ary detection based upon observations of events 
described by a time discrete Poisson process. Though the 
•formulation of the problem is of a general nature, the 
principal application is to optical communications, and 
the practical limits of such a system will govern the 
mathematical assumptions imposed. Consideration is 
given to the divergence criterion for detection and to a 
criterion of maximization of probability of detection, 
both readily accepted as suitable design objectives. The 


. GAQLIARDI AND KARP! .V-ARY POISSON DETECTION AND OPTICAL COMMUNICATIONS 


209 


intensity set yielding optimal performance on the basis of 
special cases of these criteria is shown to be a special type 
of orthogonal intensity set, composed of M disjoint 
intensities wholly concentrated in a counting interval. 
Previously, the superiority of this type of signal set in 
binary detection had been shown by Abend [2] using a 
signal-to-noise ratio criterion, and by Kailath [3] using 
a distance criterion. This paper represents an extension 
of these results to .l/-ary Poisson detection. 

The formulation of the problem follows that of Reiffen 
and Sherman [l]. The occurrence of events over an 
observed interval AT is said to obey a Poisson process if 
the probability of exactly k (an integer) events occurring 
is given by 

(nAT) k 

p (k)=- -rr-*-"' ( 1 ) 

k\ 

The parameter n is the average rate of occurrence and is 
called the intensity of the process. The average number 
of events occurring is then nAT and is often called the 
level of the process. If the events occur over a sequence 
* of intervals AT in which the intensity may vary from one 
interval to the next, but is constant over each interval, 
we have a discrete time-varying Poisson process. In 
photodetection, each event corresponds to the emission 
of an electron, which occurs upon arrival of a photon, 
each photon having a fixed energy. The level is therefore 
proportional to the average energy received per interval, 
while the intensity n is proportional to the average power 
(see Section V). Thus, constraints upon level and in- 
tensity in Poisson processes are equivalent to energy and 
power constraints on the incident radiation. 

In optical pulse-code modulation (PCM) communica- 
tions information is transmitted, as shown in Fig. 1(a), 
by sending an optical signal intensity modulated with one 
of a set of possible intensities. The modulated signal is 
corrupted by background radiation of fixed intensity 
during reception, resulting in a process whose intensity 
is the sum of both intensities. The output of the photo- 
detecting surface at the receiver is then a time-varying 
Poisson process of electron counts having the received 
intensity. In an A/-ary system, the transmitter selects 
one of a set of M intensities for the optical process. The 
receiver, after photodetection, counts the number of 
electrons in each of M intervals AT seconds long and 
attempts to maximum likelihood detect which of M 
intensities is controlling the observed process. We shall 
assume AT is suitably shorter than the inverse band- 
width of the intensities so that the intensity remains 
approximately constant over AT. In addition, we assume 
that the counting interval is exactly known at the re- 
ceiver by a perfect synchronization link. Thus, the above 
system can be modeled by the block diagram in Fig. 1(b). 
The input signal corresponds to a discrete Poisson 
process, while the interference appears as additive 
Poisson noise. (Recall that the sum of independent 
Poisson processes is itself a Poisson process having an 
intensity equal to the sum of the intensities.) The model 



(a) 


DISCRETE 

POISSON 

PROCESSES 


+ o 

MAX. LIKE, 


DETECTOR 


POISSON 

NOISE 


DECISION 


(b) 


Fig. 1. A PCM optical communications receiver and its 
equivalent model 


is idealized since other sources of interference, such as 
thermal noise and dark currents, are neglected. With 
this model the M-ary Poisson detection problem can be 
formulated as follows. Let a sequence of events obeying 
a discrete Poisson process occur over a sequence of M 
disjoint intervals AT, where MAT=T, and the count 
over each interval is independent of all others. Let the 
observed process be controlled by one of M possible 1 
intensity vectors n,-t-n 0 for q= 1,2,- • • ,M , where 

n « = { n qit n qt> ■ ■ ■ I 

no = { 71 o ,71 o ,71 q , ' ' ■ ,7lo| 

for n,„no>0. The nonnegative n qi is thus the intensity 
of n, during the tth interval. Under a fixed energy con- 
straint for each signal, we require 

M 

X) n q ,AT = N, for alPg. (3) 

i-l 

The intensity vector n 0 represents background noise of 
constant intensity superimposed upon the desired in- 
tensity. Let the corresponding number of events occur- 
ring in the zth interval be fc,-. The problem then is to de- 
termine which of the possible intensity vectors n, is 
controlling the received Poisson process by observing the 
sequence of independent counts k = [ki,ki,ki,- • • ,k M } . 
Under a maximum likelihood detection criterion and a 
priori equilikely intensities, it is well known [l] that the 
optimal test is to form the likelihood functions 

M 

A,(/c) = (4) 

«'-x 

where 

“"' los [( 1+ ^)] <5) 

' In the statement of the problem, we assume M signals over 
M counting intervals. Subsequent discussion with the divergence 
criterion disproves the need for more than M intervals. The 
problem of designing M signals over fewer than M intervals is 
not considered here. 


210 


IEEE TRANSACTIONS ON COMMUNICATION TECHNOLOGY, APRIL 1969 . 


and select as the true intensity if no other likeli- 

hood function exceeds A „(ft). If a likelihood draw occurs 
(more than one A ,(ft) is maximum), it is known that any 
randomized choice among the maxima can be used. In the 
following, we shall use a purely random selection in the 
case of likelihood draws. Equation (4) can be interpreted 
as a cross correlation of k with the a qi , an operation 
readily performed by a digital cross correlator [l]. 


II. Divergence of Detection Test 


The divergence between two intensities n, and n q of 
the above test is defined as 

Dj, = E t /j(hj,) — Ek/ q ( A>,) (6) 

where 

A y , = A,(/c) - A ,(ft) 

and Enj(\) is the conditional average of A with respect 
to k given the intensity ny. Abend [2] has shown that for 
M = 2 (binary detection) and the condition n 2 = 0, the 
divergence, normalized by the variance of A, is maxi- 
mized by a “pulsed” type of intensity, where the level 
of the process is wholly concentrated in a single counting 
interval. Kailath [3] has extended this result by showing 
that, under a total energy constraint, other suitable 
forms of “distance” are maximized by similar pulsed 
intensities. We extend these notions here to the il/-ary 
case and the equal energy constraint of (3) . 

The average divergence of an A/-ary intensity set 
{n t | will be defined as 

< 7) 


Since 2?*/y(fc,) = (ny,-f n 0 )AT, the average divergence 
becomes 


_ AT ( r n u ~ I 

D = — E £ £ («>,■ ~ n qi ) hog 1 + — 

M* y , . I L noJ 

- ,og [ 1 + 3 } 

2 AT r / n u AT\ 

"i F?r p" log ( 1 + Tr) 




( 8 ) 


where K = n 0 AT. The nonnegativeness of the ny ( and n 0 
allows us to write 


Z5 < 


2A T 


/ n ii AT\ 
ZZ»„log(i+— ) 




(9) 


< 2N log 




as an upper bound under the constraint of (3). However; 
the first equality holds if the second term in (S) is zero, 
requiring for j^q, to be zero for all i at which n qi is 
nonzero. That is, the intensities must be mutually dis- 
joint. The second and third equalities in (9) hold it 
iiji = N for one i and n„ = 0 for all other i. Thus, the 
upper bound for D occurs if the intensities of the set are 
disjoint and wholly concentrated in a single counting 
interval. This is satisfied with the set 



where 5,, is the Ivronecker delta. The above represents an 
Af-ary pulsed intensity set with each intensity occupying 
one of M intervals. It is significant that any disjoint 
intensity set, no matter how many intervals are used, 
yields the bound of the first inequality of (9), but only 
the pulsed intensity set of (10) yields the second bound. 
Thus, of all disjoint intensity sets, only the pulsed set 
maximizes D, which immediately implies that only M 
intervals are required for maximization with M intensi- 
ties. Last, it may be noted that with an average energy 
constraint over all intensities, 

n it AT = N (11) 

M y i 

instead of (3), we have rij,<MN/AT, and (9) becomes 

_ / MN \ (12) 

D < 2N log f 1 + — — j 

which exceeds that previously derived. Furthermore, the 
upper bound in (12) occurs when M — 1 intensities are 
zero everywhere, and one intensity is a pulsed intensity 
having value MN/AT. In binary communications, for 
example, this means that an on-off binary signal is 
superior to pulse position {M = 2) signaling using the 
same average energy. 

III. Detection Probability 

The optimality of the M- ary pulsed intensity set has 
been shown, based on a divergence criterion. In this 
section we show that in certain cases this superiority 
also extends over a criterion based on maximization of the 
detection probability. We first require an expression for 
the detection probability for a general intensity set jn,} . 
Usually this is obtained by first writing the conditional 
probability density of A q (k), then integrating over 
regions of correct decisioning. However, A, (ft) in (4) is a 
weighted sum of independent Poisson variates which in 
general is not a Poisson variable. Rather, the true 
density involves an A/-fold convolution of modified 
Poisson densities, yielding a result that is difficult to 
integrate. We shall instead use an alternative expression 
for the detection probability, derived in the Appendix, 
having the form 

e -k 

Pd = Z max {*(gj)} 

M R “ , 


(13) 


. QAGLIARDI AND KARP: M-ARY POISSON DETECTION AND OPTICAL COMMUNICATIONS 


211 


where N is the intensity energy constraint of (3), R yt 
the space of all il/-dimensional vectors j having non- 
negative integer components, and 


JL [(««. + no) A? 7 ]* 

= n . — e-^- ( 14 ) 


I.-! 


The derivation of (13) follows an analogous procedure 
used in Gaussian channels (see [o]), but is somewhat 
complicated by the fact that likelihood draws occur with 
nonzero probability. 

We would like to determine the intensity set jn,j for 
which Pd is maximum. This has been obtained for two 
particular cases of interest. 


Case I: M =2 and Symmetric Intensity Sets 

Let M = 2 and consider the set of all possible sym- 
metric intensity sets; i.e., if ni = [a, 6 j, then n 2 = {fe,aj. 
For this case it is easy to show that for any intensity set 
of this type, the vectors./ for which 4^(1 J)^4'(2j) when 
a>b, is simply the set j= {jiJ*}, such that j *. Using 
.the constraint of (3) and letting ri!= {a, A T — a| and n : 
= { M — u,a} , for N/2<a<N, the detection probability is 


Pd = 


g-{N+K) 

2 



«-i 

£ 


>1-0 


(a + K)“ (N - a + K)* 
ii! is! 


" " (AT - a + K)» (a + KRD 

>i-o >5. Ji'- Jj / (15) 


where again K = n 0 AT. Differentiating with respect to a 
yields 


dP D 

£—(.n+k) / oo ii— 1 4/1-1 

B A 

Ah fin-1 

da 

2 

\>,-0 >2-0 (jl — 1)1 

Js! 

Jl! (/* — 1) ! 


00 

+ £ 
>1-0 

* fin A n ~ l 

A» 

fin-1 j 


£n]il iJi - 1)! 

J2! 

Oi - 1)!/ 


p-UV+K) 


/ 30 n 

{i + £ £ 

\ > 1—1 > 2 — > 1—1 


>1 A h B n + B Jl A A ) 


j i!js! 


where dt = (a+/iL) and B = (AT — a + K). Since .4 and B 
are positive, the above substantiates Pd as a monotone 
increasing function of a. Therefore Pd is maximized with 
a having its maximum value a = N corresponding to the 
pulsed intensity set of (10) with M = 2 . 


Case II: Any M, N/K—*0 

The limit above implies a high background noise level 
situation. We observe here that 


Pd = 


e r- 

— £ max *(qj) 
M 8 


= — £ C maxi exp|£./,' 

M r" 8 l L.-i 

e~ N ( M 

♦— £ C maxi exp £ 

NjK-O Mr " 8 ( i-l 



jin q AT\ 

~K. J 


(16) 


where 

c = a/?**) 


and the limit follows since n q ,AT/K<N/K—*0. 
Now, with the constraint of (3), 


J<n qi AT 


N 


K - j ™ K 


where j mtx = max, { j.j . Thus 

e~ N ( N) N 

Pd < — - £ C exp«b m .* — >, — -> 0 . 
M «" I K) K 

The upper bound occurs when 


max 


( M 

i exp £ 


j(n q AT 1 
K f 


= exp 


{' 7m “ ij 


(17) 


(18) 


which clearly is true for the pulsed intensity set of ( 10 ), 
signifying asymptotic optimality for any M . 

To determine optimal intensity sets (either global or 
local) in the general case, using (13), still remains a diffi- 
cult task. It has been conjectured by many (e.g., see [l] 
and [3]) that the pulsed intensity set is in fact the opti- 
mal set, but to the authors’ knowledge a rigorous proof 
has not been shown. 


IV. Error Probabilities with Pulsed Intensity 
Sets 


In this section we investigate the performance of the 
pulsed intensity set in M-ary detection by evaluating 
the error probability Pg=l — Pd- This can be obtained 
by using (13), but the computation can be more con- 
veniently handled by noting that for the pulsed intensity 
set of (10) {A,} of (4) constitutes a set of independent 
Poisson random variables. The variables A, have level 
(N+K) if the 3 th intensity is sent, and have level K 
otherwise (K = n 0 AT). Recall that if the gth intensity is 
sent, a correct decision will be made with probability 
l/(r+l) if A, equals r other A’s, and exceeds the remain- 
ing M — l—r. Therefore, upon considering all possibili- 
ties, the conditional detection probability is 


P D/q — 


e -(N+MK) 

nr 


Af — 1 « r- 

£ £ 

r -0 i-l L 


+ 


(2V + KY 

x\ 


g-W+K) 


r -I K t -|*-.-rr K* T 

L5ir e J hr J 


r (M - 1)! -| 

Lr!(Af - 1 - r)!(r + 1)J 


r)!(r + l ). 1 (19) 

The right side is independent of q and thus represents the 
average detection probability. By applying the identity 

(M - 1 )! 

£ — - — 


/To (r + 1 )!(M — 1 — r) ! 


M(B/A) 


[(-!)'-] 


212 


IEEK TRANSACTIONS ON COMMUNICATION TECHNOLOGY, APRIL 1969 




0 , 10 ) 

( 5 . 10 ) 

( 10 . 10 ) 


( 20 , 10 ) 


Fig. 4. Error probability versus normalized pulse width with 
M = 2 and various operating conditions. 


we can rewrite the error probability as 
P e (N,K,M) = 1 -P D 


-( N+MK ) 


= 1 


M 


-£[ 


(N + K )* 6-^+*)- 


X ! 


Fig. 2. Error probability versus normalized signal energy N 
for M - ary communications and K = 3. 



K‘e~ K l l "- 1 

t\ J 


Ma 


[(1 + a)* - 1] 


( 20 ) 



AVERAGE NOISE COUNT PER INTERVAL, K 


Fig. 3. Error probability versus normalized noise energy K. 
A’ = normalized signal energy. 


where 


K* 



The parameter P B (N ,K,M) has been digitally computed 
for various values of N, K, and M. An exemplary plot 
is shown in Fig. 2 in which Pb(N,3,M) has been 
plotted for various .1/ as a function of N. 

It is important to note that Pe depends on both the 
normalized signal energy N and the normalized noise 
energy K in the counting interval, and not simply on 
their ratio. This fact is emphasized in Fig. 3 in which 
P b (N,K,2) is plotted as a function of K for 2 fixed ratios 
N/K. This dependence on both signal and noise energies 
distinguishes the Poisson detection problem from the 
analogous coherent Gaussian channel problem. Note that 
the interfering noise energy K depends only upon the 
background energy in the interval \T, which is the width 
of the transmitted intensity pulse. The prime advantage 
of optical systems is precisely their ability to remove the 
effect of background noise by making AT small, and has 
been emphasized in previous reportings [6], [7]. This fact 
can be illustrated graphically, using (20), by considering 
a binary Poisson channel (M = 2) sending information at 


QAGLIARDI AND KARP: M - ARY POISSON DKTKCTION AND OPTICAL COMMUNICATIONS 


213 



Fig. 5. Error probability versus normalized noise energy K for 
fixed values of S/N = N t /N +K. M =2. 


* a rate 1/T bit/s. The effect of the parameter AT is 
indicated by plotting Pe(N, n 0 TAT/T t 2) as a function 
of AT/T, for fixed energy N and background noise 
energy per bit interval n 0 T. This is shown in Fig. 4. The 
results indicate the continuous improvement obtained by 
decreasing the “duty cycle” AT/T, the ultimate limit 
corresponding to AT = 0. The improvement, of course, is 
made at the expense of information bandwidth and peak 
power (both inversely proportional to AT). Surprisingly, 
the improvement is quite small at low values of N, and 
the increase in bandwidth may not be worth the decrease 
obtained in error probability. 

A quantity of particular interest to communication 
engineers is the detected signal-to-noise ratio. This is 
often defined [8] as the ratio of the square of the average 
electron count with no noise to the variance of the count 
when noise is present. For Poisson counting statistics 
with pulsed intensities, this becomes S/N = N-/N+K. 
The behavior of Pe of (20) as a function of K for fixed 
S/N is illustrated in Fig. 5 for a binary system. The 
results again indicate the ambiguity in using S/N as a 
design criterion. The asymptotes show the wide func- 
tional variation of Pe as K increases from zero. 

As illustrated in Fig. 2, the error probabilities increase 
as M increases. However, the use of a single set of curves 
to compare various d/-ary systems is misleading. An .1/- 
ary system with AT-second counting intervals transmits 
logs M bits of information in MAT seconds. It therefore 
communicates at a rate 


log 2 M 

MAT 


bit/s. 


( 21 ) 


If the transmission rate is normalized for each M, AT 
must be readjusted to maintain a fixed rate R = R 0 . The 
effective noise level per counting interval is then 



Averoge Number of Signal Counts, N 


Fig. 6. Error probability versus normalized signal energy N, 
each M adjusted for fixed information rate R a . K 0 = nor- 
malized noise energy per interval l/2/f 0 . 


214 


IF. EE TRANSACTIONS ON COMMUNICATION TECHNOLOGY, APRIL 1969 



log>.U 

M 


= 2K 0 


logo A/ 

M 


( 22 ) 


where K 0 is the noise energy in an interval l/2R 0 . Thus, 
for a comparison of different A/-ary systems, each with 
fixed information rates, one should compare the para- 
meter P E { N, 2fv 0 (log A/)/A/, M) for each N. If this 
adjustment is made using (20), the curves of Fig. 6 are 
generated, with Ao=l. 

The curve corresponding to M = * is also shown, and 
is determined by taking the limit of P E (N, 2/y 0 (log M )/ 
M, M ) as M — ►». This can be obtained by replacing K 
in (20) by A' = 2/v'o(log M)/M and noting 

lim e -< K + MK ">-*0 

if**, 


lim 

M -+* ao 



(K'Y 

a 


^r-r d + - 1 

J L Ma 



0, x = 1 

1, x > 1. 


Using the above we then have 


lim P E [ N,2K 0 

M-+ oo \ 


M ) 



(NYe~ N 

x! 


= I — (1 — Ne~ N - e~ N ) 

= (I + N)e~ w (23) 


which is plotted as M = «> in Fig. 6. It is noteworthy that 
(23) is precisely the probability of an event count of 
zero or one occurring in a noiseless counting interval of 
signal energy N. This has the following interpretation. 
As M — >«>, the number of intervals becomes infinite, but 
the normalized noise energy per interval A' = 2A 0 (log 
M)/M approaches zero. The probability that more than 
one event will occur in any one of M — 1 independent 
nonsignaling intervals having noise energy K' is given by 

1 - [(1 + A')e-*'p'-‘ 

This approaches zero as M — ►», indicating that counts of 
zero or one will occur in every such interval with proba- 
bility one. Furthermore, there will be an infinite number 
of intervals with a zero count and with a one count. 
Therefore, as M — *», an error will occur (with proba- 
bility approaching one) whenever the signaling interval 
has a count of zero or one, and an error will never occur 
when the latter interval has a count greater than one. 
Hence we have (23). 

It is also interesting to note in Fig. 6 that the best 
system operation, in terms of minimal error probability, 
does not always correspond to M— »°o. In fact, it can be 
shown that best M operation depends strongly on the 
amount of background noise K 0 . For example, if K 0 = 0, 
it is easy to show, using (20), that for A/ finite, P E (N, K\ 
M) =(A/— l)e~- v /A/, which is monotonically decreasing 
with M and always less than the M = « value of (23). 
Thus, with negligible background noise, system operation 


improves with decreasing A/, and is best for M = 2- 
Physically, this means the noise reduction advantage 
due to decreasing AT as M increases does not offset the 
increasing errors due to the larger numbers of likelihood 
draws that will occur. (Recall a random choice is made 
in the event of draws.) For large amounts of background 
noise, however, the converse is true, and Af = » does 
yield minimal error probability. 

It should be emphasized that a fixed energy constraint 
was imposed on the signal intensity, and therefore the 
time average power P 0 = N/T = NR/iog M actually 
approaches zero as M— *a>. If the average power level 
of the source has been fixed at some level Pa, N must 
be replaced by P o log M/R in the previous equations, 
and we find P E —> 0 is A/— > = ° for any P 0 > 0. This result 
may be compared to a similar result for an additive 
Gaussian channel [9] in which zero error probability 
occurred only if P 0 satisfied a condition dependent on 
the rate R. 

The P E results above are useful for determining the 
channel capacity (maximum information rate) of an A/- 
ary pulsed intensity set. Assume a transmitter sends one 
of an A/ pulsed intensities every T seconds, with each 
pulse having width AT = T / M . If the transmitter oper- 
ates at a fixed rate R 0 , then again 7’=(log M)/R 0 as 
given by (21). The channel can now be represented as a 
symmetric channel in which each of the M equally likely 
intensities is converted to itself with probability 1— P E , 
and is converted to any of the other intensities with equal 
probability P E /(M— 1). The channel capacity for this 
type of system is known to be 


C = 


log M + Pe log (P*/(A/-1)) + (1-P e ) log (1-P*) 


log M/R o 


(24) 


where P E = P E (N, n 0 log M/MR 0 , M). We shall again 
consider the signal intensity energy N, the background 
noise power n 0 , and the rate R o to be held fixed. Then, as 
Af— >oo, P E approaches the limit in (23), while the chan- 
nel capacity has the limit 

C-> [1 - (1 + A)e-‘ V ]P„ (25) 


for N finite. The above indicates that information transfer 
can be forced to approach any desired rate with a finite 
signal energy by using an increasingly larger number of 
intensities and adjusting R 0 at the transmitter. However, 
each level is transmitted with a nonzero error probability, 
and the information bandwidth and peak power become 
infinite. Again introduction of a transmitter power con- 
straint, instead of an energy constraint, will yield opera- 
tion at a capacity R 0 with a zero error probability as 
A/-**>. 


V. Summary and Application of Results 

In this paper we have investigated M - ary Poisson 
detection, defined as the maximum likelihood detection 
of one of a set of M discrete Poisson processes in the 
presence of an additive discrete Poisson noise process. 


GAGLIARDI AND KARP: .V-ARY POISSON DETECTION AND OPTICAL COMMUNICATIONS 


215 


The model represents a discrete version of an optical 
communication system in which the observables are 
counts of photoelectrons, the signals are intensity modu- 
lated continuous-wave optical sources, and the noise is 
background radiation received within the optical band- 
width. The photoelectron count can then be modeled as 
a time varying Poisson process whose average rate is 
proportional to the sum of the intensities of the modu- 
lated source and the background radiation. In practical 
operation, the intensity of the optical source is a con- 
tinuous process, but the analysis can be put on a discrete 
basis by partitioning the signaling intervals into sub- 
intervals over which the intensity is taken to be con- 
stant. The above Poisson model is examined, and the 
advantages of a pulsed type of intensity set are demon- 
strated. The latter corresponds to an optical system 
using pulse position modulation in which information is 
transmitted by a burst or pulse of optical energy located 
in one of a set of pulse positions. The performance of such 
a system, in terms of pulse width and numbers of pulse 
positions, is presented here. The results of this paper 
basically represent theoretical limits which an optical 
link can approach, since the deleterious effects of re- 
ceiver (thermal) noise have been neglected. This latter 
assumption becomes valid, for example, when photo- 
multipliers are used in detection, and the background 
radiation collected at the receiver is the predominant 
source of noise. 

The analyses and performance results are in terms of 
N and K, the average electron counts due to signal and 
noise, respectively. However, these results can be easily 
converted to average power requirements by using the 
relations 

N = vP.M/h/B 


where P{I)\q) is the probability of correct detection, 
given that n,+no is the true intensity. Now the condi- 
tional probability of the occurrence of an observed vector 
k=j= ■ ■ )Jm } , given n q +n a , is 


i tt [(”«. + n 0 )Ar] A 
P(k = j | q) = n — : — — e-' w "» ,Ar e- v 


4 V(q,j) e- w 


(27) 


where N is the energy constraint given in (3). The condi- 
tional detection probability P(D\q) is then obtained by 
summing over the set of all j, such that a correct decision 
is made. A correct decision will occur when the 3 th 
intensity is used, if A, is selected as being the largest. If 
no other A, exceeds A„ but r of the A, equal A„ the re- 
ceiver will be correct with a probability of I/(r-fl), 
assuming a purely random selection is made when likeli- 
hood equalities occur. Now j is an .1/ dimensional vector 
with nonnegative integer components, and we shall 
denote the space of all such vectors as R M . The condi- 
tional detection probability P(D\q) can therefore be 
written by summing over all j(ER'' leading to a correct 
decision. Thus, 

P(D\ q) = M £ — J— £ *(q,j) e-‘ v (28) 

r-0 r T 1 Jq r 

where J qr is the set of j£R u such that no other A, 
exceeds A„ and r other A, equal A,. If we let /, denote the 
r dimensional index set corresponding to these r A ( , we 
can for simplicity denote J qr symbolically as 

J„r = \j C R M : A, = max A* = A„ t C /,}• (29) 

k 

Substituting (2S) into (26) yields a general expression 
for the detection probability: 


• K = » iPn/hfB 

where P, and P n are the average signal and background 
noise power, /i = 6.62X10~ M Js, 17 is the photodetector 
efficiency (including photomultiplication), /is the optical 
frequency of the continuous-wave source, and B = \/AT. 
The average power P. and P„ can be further converted to 
transmitted power by introducing space losses and re- 
ceiver optics (e.g., see [10, chs. 1 and 2]). Exact syn- 
chronization has been assumed here between transmitter 
and receiver at all times. Besides receiver thermal noise, 
the analysis has excluded the effects of photomultiplier 
statistics, saturation, and dark currents. We also have 
assumed constant intensity background radiation which 
implies a wide-band optical filter. This assumption 
restricts the minimum value of AT to approximately 
IQ-io-iO -12 second. 


Appendix 

In this Appendix we derive (13) of the report. The 
average probability of correctly determining the true 
intensity in Af-ary transmission is 

Pd = -J- ZP(D\q) (26) 


Pn = 


fi -N M .If- 1 

sr v V* 




(30) 


Now by examining carefully the set J qr we can simplify 
the above. Making use of the monotonicity of the 
exponential function, we can write: 


J q , = \j C R M : exp (A a ) = exp (max A*) 

k 

= exp A,, t C /„} 


[jC R": n + no)AT]A 

l »-l 


.V 

= max JJ [(n*. + n 0 )A7’]« (31) 

k i-1 
M 

= II [(»«< + no)AT]« t C I q 

t -1 

= \j C R M : 'k (q,j) = max 'k(fcj) 

k 

= 

Thus J„ r can be alternatively defined as the set of j 
for which ( q>q,j ) is one of r +1 maximum 'k(fcj) func- 
tions. This means every j in J qr also belongs to r other 


216 


IEEE TRANSACTIONS ON COMMUNICATION TECHNOLOGY, APRIL 1969 


sets J lrt tQI q , or correspondingly, a point j in J , f , 
<C exists such that 'fr(t,j)='ty(q,j). Note that the set 
of subspaces {«/„,.} are disjoint for different r, but not 
for different q. With these facts consider the summation 

MqJ) 


M 

£ £ 

0—1 Jqr 


r + 1 


(32) 


for fixed r. For any term of the sum, say 'I , (< 7 o,Jo)/(r-h 1), 
there exist r other terms having the same value, one for 
each point jo of J lr , t(ZI Q0 ■ The total contribution to the 
sum above from this set of r+1 terms is then 


, . f ^(qojo) 1 

(r + 1) I j = *(qo,]o) 

= max V(q,j 0 ) 

« 


(33) 


the last equation following since jo(E.Jq 0 r- Thus, over- 
lapping points in the summation of (32) contribute a 
total amount given by (33). It therefore follows that 

M v(q,j) 

£ £ ~f 7 = £ max *(qj) (34) 

9-1 Jq r T + 1 KJJq, 9 

* q 


where W,J 9r is the union over q of the subsets {</„.}. 
Inverting the order of summation in (30) and using (34) 
allows us to rewrite 


Pd = 


g-N Af-l 



£ max 'F(gJ) 

\JJqr q 
q 


P -N 


= — £ max V(q,j) 
M RM q 


(35) 


where we have employed the fact that the are 

disjoint subspaces, and the sum over all r spans the 
whole space R' r . Equation (35) is the same as (13). 


References 

[1] B. Reiffen and H. Sherman, “On optimum demodulator for 
Poisson processes: photon source detectors,” Proc. IEEE, 
vol. 51, pp. 1316-1320, October 1963. 

[2] K. Abend, “Optimum photon detection,” IEEE Trans. 
Information Theory ( Correspondence ), vol. IT-12, pp. 64-65, 
January 1966. 

[3] T. Kailuth, “The divergence and Bhattacharyya distance 
measures in signal selection,” IEEE Trans. Communication 
Technology, vol. COM-15, pp. 52-60, February 1967. 

[4] C. W. Helstrom, “The detection and resolution of optical 
signals,” IEEE Trans. Information Theory, vol. IT-10, 
pp. 275-287, October 1964. 

[5] A. Viterbi, Principles of Coherent Communication. New 
York: McGraw-Hill, 1966, p. 234. 


[6] M. Ross, “Pulse interval modulation laser communica- 
tions," presented at the Eastcon Convention, Washington, 
I). C., October 1967. 

[7] S. Karp and 1!. (iaglinrdi, “A low duty cycle optical com- 
munication system," presented at the Eastcon Convention, 
Washington, I). (\. ( titober 1967. 

[8] W. Pratt, “Binary detection in an optical polarization 
modulation communication channel,” IEEE Trans. Com- 
munication Technology tConcise Papers), vol. COM-14, 
pp. 664-665, October 1966. 

[9] A. Viterbi [5], p. 226. 

[lit] M . Iloss, Last r Receivers. New York: Wiley, 1966. 


S3* 



1 ” 1 CHIB Robert M. G agliardi (S’57-M’61 ) 

was born in 

e received tl 

egree in electrical engineer- 
ing from the University of Con- 
necticut, Storrs, in 1956, and the 
M.S. and Ph.D. degrees in en- 
gineering from Yale University, 
New Ilaven, in 1957 and 1960, 
respectively. 

From 1958 to 1960 he was an 
Instructor at the New Haven 
Engineering College. In 1960 he 
joined the Information Study 
Section, Space System Division, 
Hughes Aircraft Company, Cul- 
ver City, Calif., where he was involved in problems in telemetry 
and communication systems. He is presently an Associate Pro- 
fessor in the Department of Electrical Engineering, University of 
Southern California, Los Angeles, and a Consultant to Hughes 
Aircraft Company. 

Dr. (iagliardi is a member of Eta Kappa Nu, Tau Beta Pi, 
and Sigma \i. 


. y 

CAvb 



Sherman Karp (M’62) was born 


in 



4 on OT 

ivea the 


ie received 

and ivFS.E.E. degrees from the 
Massachusetts Institute of Tech- 
nology, Cambridge, in 1960 and 
1962, respectively, and the Ph.D. 
degree in electrical engineering 
from the University of Southern 
California, Los Angeles, in 1967. 

After several years of industrial 
experience, he joined NASA Elec- 
tronics Research Center, Cam- 
bridge, Mass. He is presently 
Section Chief in Optical Com- 
munications in the Optics Labora- 
tory. His main interest is in the general area of reliable communi- 
cation systems with interest in modulation and coding techniques. 

Dr. Karp is a member of Tau Beta Pi, Eta Kappa Nu, and 
Sigma Xi. 






-A 


fj-tb' kW I 



NASA supported under Grant 
NGL 05-018-104 


energy. For a wide range of signal intensities | S(l) |*, this can be 
approximated by a Poisson distribution with intensity 

01 S(t) | * + fiP 

where 0N O « 1 and 2 BT » 1. P is the total average noise power 
2BNt. Estimation of the delay of one signal in optical communica- 
tion is discussed by Karp and Clark [1] and Bar-David Q2], In this 
note we want to estimate the delay of .1/ pulse-position-modulated 
(PPM) signals in Laguerre communications when the receiver does 
not know which signal is present. In this .l/-ary PPM system, the 
transmitter selects one of the set of .If intensities for the optical 
process. The receiver, after photo detection, counts the K\, the num- 
ber of electrons in each (.If + 1 ) intervals, AT seconds long, and 
attempts to maximum likelihood detect which of M intensities is 
controlling the observed proress. On the basis of this decision, the 
maximum likelihood estimate of the delay 0, 0 < 0 < A T, is derived 
when the signal-to-noise ratio (SNR) is very high and it is compared 
with distribution of Poisson statistics. 

MAXIMUM LIKELIHOOD DETECTOR 


Reprinted by permission from 
IEEE TRANSACTIONS ON COMMUNICATIONS 
Vol. COM-22, No. 5, May 1974 

Copyright © 1974. by the Institute of Electrical and Electronics Engineers. Inc 
PRINTED IN THE U S A. 


Assuming the cost matrix |c„ ) is such that c„ = 0, r„ = 1, t yt j, 
and the signals are equilikely, we decide S, if joint distribution 
(fCt,* • •,/C^ + i) satisfies 

P(X 1 ,X,,-..,Xv + t/S.) = max P(K„K„- /Si). (2) 

1</<J M 

Since our assumption is white band-limited noise, counts in each 
interval, i.e., A', counts in the interval ( (t — l)AT,iAT), are inde- 
pendent 

ti+t 

P(X,,A J( -..,Av + ,/S J ) = n P(X,/Sy). 

i-1 


Estimation of Delay of M PPM Signals in Laguerre 
Communications 


Signals are given by 

[VF, if AT(i - 1) < t < iAT 

5.(0 = { 

[0, elsewhere. 

Since (1) is generated by convolution of Af-fold, the joint distribu- 
tion of Kt,Ki, • • •,Ku + 1 is given by 


N. C. MOHANTY, member, ieee 

Abstract — In this note the maximum likelihood detector for M 
pulse-position-modulated (PPM) signals in Laguerre communica- 
tions is derived. A decision-directed maximum likelihood estimator 
for the delay of Af PPM signals is discussed. 


INTRODUCTION 


The output of an idealized quantum photo detector is a doubly 
stochastic Poisson process with the intensity of the process 

MO - 0|S«) + n(f) I s 

where n(t),te(0,T), is zero-mean white Gaussian band limited to 
db/J Hz and s(t ) is a deterministic signal occupying the same band. 
Karp and Clark [1 ] have shown that the probability of the number 
of counts Nt in [0,r] is K and is given by Laguerre distribution, i.e., 


PINt - A] = 




•exp 


(1 + 0Xo) 


(1 + 0N„) l+K+tBT 

I S(t) \'dl j i 

Ljt ,sr | - / I S(t) \'dt j iV,(l + 0N„) 


K' 

[-T 


(1) 


where 0 is a constant directly proportional to the detector quantum 
efficiency and surface area and inversely proportional to photon 

Paper approved by the Associate Editor for Space Communications of 
the IEEE: Communications Society for publication without oral presenta- 
tion. Manuscript received March 11)73. revised November C3. 11173. 

The author Is with the Department of Electrical Engineerimt. State 
University of New York at Buffalo. Buffalo. N. Y 


P(.Ki,Kt,‘ • ’,Ku + i/8j) = a K ibL Ki ((AT — fl)c)Z»jr j+ i(— 9c) 
where 


a ” , ,, > Kt ■ Ki + Jfi + • • • -(■ Ku + 1 

t -+- 0I\ 0 

i r ate i 

" 1 + 0N O CXP [ 1 + fLV.J 

E 

c = T, .. — , 0 = delay parameter. 

Wo(l -MXo) 

Therefore, (2) becomes 

L k% ((AT - 0)c)L* 1+1 (— 0c) = max L*, (- (AT - 0)c)Ljc, tI (-0c) 

t 

then Si is sent. Since this maximization is true for all 0, and Lk(—x) 
is a monotonically increasing function in x > 0, the rule is to decide 
>i if 

J L K , ( - (AT - 0)cL,, +1 ( -9c) dd 

- max / L Ki (-(AT - 9)c)L Ki . l (-9c) d9. 
1<)<V J 

Using identities [3], and after some algebra, decide Si if 
L K , +Ki+l + '(-ATE) = max L* l+ *, + , +, (-A7’E). 

i<i<» 

Since Laguerre polynomials are increasing, the decision criteria be- 
comes decide a, if 


A, 4- Ki + 1 = max Kj -f- Kj + 1 . 


714 


IEEE TRANSACTIONS ON COMMUNICATIONS, MAT 1974 



Cec'rfes Sj 

h ** 1.1 

is •naxi-niff 


if 


Fig. 1. Maximum likelihood detector of .1/ PPM signals. 


CONCLUSION 

An adaptive estimator based on decision criteria is derived on the 
assumption of very high SNR. A minimum mean-square estimator 
can be derived if we assume some statistics on 6. Related mattere 
are discussed in [4J-Q6], 

ACKNOWLEDGMENT 

The author wishes to thank Prof. R. M. Gagliardi for helpful 
discussions. 


REFERENCES 

(1) S. Karp and J. K. Clark. "Photon count ing: A problem in classical 
noise theory." IEEE Trans. Inform. Theory, vol. IT-16, pp. 672— 
680. Nov. 1070. 

(2] I. Har-David. “Communication under the Poisson regime.” IEEE 
Trans. Inform. Theory, vol. IT-15, pp. 31-37. Jan. 1969. 

[3J G. Sansone. Orthogonal functions. New York: Interscience, 1959, 
ch. 4. 

[4] R. Gagliardi and N. Mohanty, "Estimation theory and optical 
communication." Electron. Sci. Lab.. Univ. Southern California, 
Los Angeles. 1‘SCEE Hep. 446. Apr. 1973. 

[5] , "MAP synchronization in optical communication systems," 

Electron. Sci. Lab.. Univ. Southern California. Los Angeles, U8CEE 
Rep. 448, Apr. 1973. 

161 N. C. Mohanty. "Af-ary Laguerre detection." IEEE Trans. Aerosp. 
Electron. Syst. (Corresp.j. vol. AES-9, pp. 464-467, May 1973. 


The receiver counts the number of photons in each of two adjacent 
counters and compares and decides s, if the count is maximum in 
the ith counter of 2 aT. See Fig. 1. When the counting statistics are 
Poisson (i.e., when the SNR is very high), it can be easily verified 
that the decision criteria is the same as the preceding statements. 


MAXIMUM LIKELIHOOD ESTIMATOR 

Based on the decision that s, i_s present, we will find that the 
maximum likelihood estimator 8l'8l must satisfy 


IUS.- (A T - e)c]Lg itI (-flc)] - 0. (3) 

Again using Laguerre identities [3], (3) becomes 
L Ki _ x '(-AT - d L )c)L Ki+x (-9vc) 

= Lg,(— (AT — 6i)c)LK itl -t'(—9ix). 

Assuming very high SNR, 

(-2)* 


£“(*) K[ , 

Using (3) and (4), 8 must satisfy 
(AT - 9 l )c 


+ 


Ki 


when x » 1. 


8ic 

K i+ t 


(4) 


(5) 


From (5), we get 


ATK, + i 
K, + K,+i ' 


When the counting statistics are Poisson, the maximum likelihood 
estimator for 8 must satisfy 

J [[(AT - 8 P )E + P~\ K '[8pE + £]*•+>] = 0. 

dd 

This, on simplification, gives 

(AT -8 F )E + P 8 P E+P 


Ki 


K (+ 


( 6 ) 


From (6), we get 


Bp -- 


ATEKt+i + P(K i+i - Ki) 


E (K, + K i+I ) 
When E/P » 1, 8p and 8l are almost equal. 




fli5- 0 O/ 



in? 


Reprinted by permission from IEEE TRANSACTIONS ON INFORMATION THEORY 
Vol. IT-18, No. 4, July 1972, pp. 514-515 
Copyright 1972 by The Institute of Electrical and Electronics Engineers, Inc. 
PRINTED IN THE U.S.A. 


On the Identifiability of Finite Mixtures of Laguerre 
Distributions 


N. C. MOHANTY 


Proof: 

P(N = K) = exp **0 - Z) I+l Ll (x). 


Abstract — Finite mixtures of Laguerre distribution are identifiable. 

Introduction 

Identifiability is a necessary criterion for estimating parameters 
in a mixture distribution. It is known [1 ], [2] that the classes of 
normal, exponential, Cauchy, and negative binomial distribu- 
tions are identifiable. It was established by Feller [3] that 
mixtures of Poisson distributions are identifiable. In this corre- 
spondence we show that mixtures of Laguerre distributions are 
identifiable using a theorem of Teicher [4], It is worthwhile to 
mention that the Poisson distribution is a limiting case of the 
Laguerre distribution, both of which are encountered in optical 
communication theory [5], 

Laguerre and Poisson Mixtures 


where L K *( ) is the associated Laguerre polynomial of order a, 
x is the parameter, and \Z\ < 1. 

The Laplace transform of P(N = K) [6] is denoted by 


<HD = 


(1 - Zf + 1 

exp 

(1 - eZ-"f +l 




where A = Z/( I - Z). If x 2 > x t , then P(x lt K) < P(x 2 ,K) 


«;<7> * “ p h ■ ( A ■ r=^). 

Now with Sq = (—oo, + oo) and r, = log Z 


Iim**> = 0 . 


We shall use the following theorem. 

Theorem [4]: Let = {F} be a family of cumulative dis- 
tributions with transforms <J >(r) defined for t e S® (the domain of 
definition of <t>) such that the mapping M: F -> <t> is linear and 
one to one. Suppose that there exists a total ordering ( < ) of 3F 
such that F, < F 2 implies i) S®, < S (Dj, ii) the existence of 
some /, 6 5*, (/, being independent of <t> 2 ) such that lim,^ (| <J> 2 (/)/ 
<&!(/) = 0. Then the class K' of all finite mixtures of 3F is 
identifiable. 

Proposition: The class of all finite mixtures of Laguerre dis- 
tributions is identifiable. 

Manuscript received December 10, 1971; revised February 15, 1972. 
This work was sponsored by the National Aeronautics and Space Ad- 
ministration, under NASA Contract NGR-05-0I8-0I4. 

The author is with the Department of Electrical Engineering, University 
of Southern California, Los Angeles, Calif. 90007. 


Conclusion 

When the counting statistics are Poisson or Laguerre, finite 
mixtures of the counting statistics are identifiable. 


References 

[1] S. J. Yakowitz, “Unsupervised learning and identification of finite 
mixtures," IEEE Trans. Inform. Theory, vol. IT- 16, pp. 330-338, May 
1970. 

[2] S. J. Yakowitz and J. Spragins, “On the identifiability of finite mix- 
tures,” Ann. Math. Statist., vol. 39, pp. 209-214, Feb. 1968. 

[3] W. Feller, “On a general class of contagious distributions," Ann. Math. 
Statist., vol. 14, pp. 389-433, 1943. 

[4] H. Teicher, "Identifiability of finite mixtures," Ann. Math. Statist., 
vol. 34, pp. 1265-1269, 1963. 

[5] S. Karp and J. R. Clark, "Photon counting: A problem in classical 
noise theory," IEEE Trans. Inform. Theory, vol. IT-16, pp. 672-680, 
Nov. 1970. 

[6] N. C. Mohanty, "Synchronization for adaptive optical communica- 
tions," Ph.D. dissertation, Univ. Southern California, Los Angeles. 
1972. 


M ary Laguerre Detection 


Abstract 


In this correspondence, maximum-likelihood Af-ary detection theory 
i( applied to an incoherent optical system model employing photo 
detectors governed by Laguerre counting statistics. It is shown that 
a maximum-likelihood Laguerre detector corresponds to a count 
comparison over each signaling interval. Error probability is derived. 

I. Introduction 

In a communication system using optical devices, the 
receiver is modeled as a counter of electrons which are 
emitted from a photodetector when a modulated optical 
wave in the form of photons is incident on it. The synthesis 
of the optimal receiver processing and its resulting perform- 
ance depend upon the statistics associated with this count- 
ing. The probability of obtaining K photoelectrons in the 
finite time interval (0, /) due to a general radiant source is 
Poisson distributed according to the function 


Manuscript received June 5, 1972. 

This work was sponsored under Contract NGR-05-018-104. 


IEEE TRANSACTIONS ON AEROSPACE AND ELECTRONIC SYSTEMS 


MAY 1973 



where a is constant associated with the photodetector, 
U R , is the random variable that represents the number of 
photoelectron counts, and /(,, is the intensity of the light. 
When /(,) is a random function of time, the sum of a 
deterministic signal and a random noise, the probability 
distribution is a random function of time. The required 
stationary counting distribution P(U R , = A), which char- 
acterizes the detection process, is found by the statistical 
average of P[U R ,(/(/)] : 


°° r t 

rpR,,=K] = ^ J ot'J I{t)dt 


0 L 0 

t 


• exp 


-a f l(t')dt' 

Jf\ 


PfP) dt (1) 


where P(f) is the probability distribution of the intensity 
7(f) of the optical wave incident upon the detector 
[7, pp. 11-12]. In general, P{U R t = A ) = P(K) is a double 
stochastic Poisson process [ 8 ]. We will consider the case 
in which intensity is the complex envelope of the optical 
field; i.e.. 


I(t) = 15(f) + n(/)l 2 

where S(t ) is a deterministic signal and n(t) is zero-mean 
Gaussian noise band-limited to ±B Hz. With a = 1 and 
signal bandwidth less than noise bandwidth, taking first, 
( 2 Bt + 1 ) eigenvalues of the covariance of n(t), it can be 
shown [1] that the probability distribution of A electrons 
is given by 


UK) = 


n k 


(1 +vV)Cr+A- + l 


exp 


1 +JV 


l a 

l k 


MM 

jV(1 +A0J 


( 2 ) 


where N is the average number of noise photon counts per 
time space mode, given by Planck’s formula N = [exp fhf/en) 
- 1] _ 1, where h is Planck’s constant, e is Boltzmann’s 
constant, and f is the optical carrier frequency, E is the 
energy of the signal, 


K 



m=0 


and a = 2 Bt is the time-bandwidth product. 


When the counting statistics are Poisson, the detection 
and estimation problems are considered by Reiffen and 
Sherman [2], Bar-David [3], and Gagliardi and Karp 
[4] . In this correspondence, we will consider detections of 
M signals when counting statistics are governed by Laguerre 
distribution. Detection schemes for two signals in this 
area are considered by Helstrom [5 ] and Gagliardi [ 6 ], 


II. M- ary Laguerre Detection 

In an Af-ary system, the transmitter selects one of a set 
of Af intensities for the optical process. The receiver, after 
photodetection, counts the number of electrons in each of 
M intervals AT(M&T = t seconds long, and attempts maxi- 
mum-likelihood detection of which of M intensities is 
controlling the observed process. We shall assume that AT 
is suitably shorter than the inverse bandwidth of the 
intensities, so that the intensity remains approximately 
constant over AT. In addition, we assume that the counting 
interval is exactly known at the receiver by a perfect 
synchronization link, signals are equilikely, and the cost 
matrix is given by C,y , c„- = 0, c ^ = 1 , 1 #/. After observing 
K = (AT 1( K 2 , A 3 , . . . , K M ), where A, is the count in the 
interval [(/ - 1)A7\ iAT\ and MAT = t, we decide S, if 

7XA15,) = max AAtS ) . (3) 

1</<M ' 


Since our assumption is white noise, the A, are independent, 

M 

(4) 

i=l 


m 

7XAIS)) = J~J HK,\Sj) . 


E, if A7V- 1) < / <riAT 
0, elsewhere . (5) 


Signals are given by 

S,{t) = 

Since (2) is generated by convolution of A/-fold 

L * i 


N*i 

F(A,IS) = exp 


(1 +A rfi* 1 


-E' 


1 +N 


E‘ 


N(\ +A0 


where P is the signal energy in the ith counting interval, 
and from (5), we get 


N*i 

PiK^i) = exp 


TXAylS,) = 


(1 +A ^< +1 
N*i 

(1 +A0 1+Ar « 


1 +JV 


L K i 


-E 


N( 1 +A0 


i*i- 


( 6 ) 


CORRESPONDENCE 


465 


Using (4), (5), and (6), 


Using [7] , we get, from (8), 



Substituting (7) into (3) and cancelling common terms, we 
get 


(E** 1 X bL Ki . x [C\ 


a (Af-2)K,~ 1\ 


\_± 

'i [rV(l + 


AO 


max L k 
K/<M 


’ -E 
N(\ + jV) 


( 8 ) 


if Sj is decided. L m [-*] is a monotonic increasing function 
if x > 0. Therefore, K; = max K: if S f is decided. The 

\<i<M 

scheme is to make a comparison of counts, and then to pick 
the maximum one. When M = 2, our result agrees with 
Gagliardi [6] . 



+ ... 
+ ... 


+ 



l bL Kt [C] 



( 10 ) 


III. Error Probability 


where 


Our decision scheme suggests that, after counting 
electrons in M counters, we select S, if the count in the rth 
count is maximum. Following the procedure of Gagliardi 
and Karp [4] , probability of error PE, when signal S f is 
decided, is given by 


a = 


N 


\ + N ’ 


b = exp 


E 

1 + N 


C 


-E 

rV(l+A0' 


PE=E E' E P ( K \\Si)P(.Kl\Si)P(KM\Si) 

Ki K m =K ( 


By using the identity 

oo 

E ZKl k{x) = (1 -z) -1 ex P 




xz 


z - 1 


E E E - E we get, from the (10), 

*, = 0 AT y _| =0 Ki-i=Ki K M =Kj 


X P(K 2 \Si)P(K M \S) 

/ v « M O* OO OO 

*{,) E E E E - E 


PE = 


P(K x \Si) 


1 


M 


1 + N, 


(1-a) 




exp 


-Ca M 


_ j 


1 (M\ b 2 


2 V2 


— (1 -a M ! ) ‘ exp 


-1 


(1-a) 


Ca M ~ 2 


,Af- 1 _ 


• P(K 2 \Si)P{K M \S) 


+ ••• 
+ ••• 


• (1 “«) 1 


exp 


~Ca 
a - 1 



'M\ b 3 

J/O-af- 3 


• (1 -a M 2 ) exp 


Ca"~^ 

“2—1 


(1 ~a) 



[m- i ) £ pwtwm 

A | = 0 


P(K,|S,)l. (9) 



IEEE TRANSACTIONS ON AEROSPACE AND ELECTRONIC SYSTEMS 


MAY 1973 


On further simplification, we get 


PE = 


1 

1 + N 


exp 


b exp 


ca' 


.1 -a M 


\-<P 


-V 1 l( M \ b r 
L r[ r - 1 ]( X - a M-r+\) 


-rca 
a - 1 


1 

2 At-r+l -jJ ' 


This result can be computed very easily for various values of 
M, E, and 


Acknowledgment 

The author is indebeted to Prof. R. M. Gagliardi for 
valuable discussions. 


N. C. MOHANTY 

University of Southern California 

Los Angeles, Calif. 90007 


Reference* 

[1) S. Karp and J.R. Clark, “Photon counting: A problem in 
classical noise theory," IEEE Trans. Information Theory, 
vol. IT-16, November 1970. 

[2) B. Reiffen and H. Sherman, “On optimum demodulator for 

Poisson process: Photon source detector," Proc. IEEE, 

vol. 51, pp. 1316-1320, October 1963. 

[3) 1. Bar-David, “Communication under the Poisson regime ,” 
IEEE Trans. Information Theory, vol. IT-15, January 1969. 

[4) R.M. Gagliardi and D. Karp, "M - ary Poisson detection and 
optical communication," IEEE Trans. Communication Tech- 
nology, vol. COM-17, April 1969. 

[5) C.W. Helstrom, “Performance of an ideal quantum receiver 
of a coherent signal of random noise," IEEE TYans. Aero- 
space and Electronic Systems, pp. 562-566, May 1969. 

[6 1 R.M. Gagliardi, "Photon counting and Laguerre detection," 
IEEE Trans. Information Theory, vol. IT-18, January 1972. 

(7) W.K. Pratt, Laser Communication Systems. New York: 
Wiley, 1969. 

[8] D.L. Snyder, “Filtering and detection for doubly stochastic 
Poisson process ," IEEE Trans. Information Theory, vol. IT-18, 
pp. 91-101, January 1972. 


CORRESPONDENCE 


447 


0 < 760 / 



Reprinted from the PROCEEDINGS OF THE IEEE 
VOL. 58, NO. 10. OCTOBER, 1970 

pp. 1611-1626 

Copyright © 1970 The Institute of Elec trical and Electronics Engineers, Inc. 

PRINTED IN THE U.S.A. 


Communication Theory for the Free- 
Space Optical Channel 

S. KARP, member, ieee, E. L. O'NEILL, and R. M. GAGLIARDI, member, ieee 


Abstract — The current understanding of quantum detectors, the 
noise mechanisms which limit (are basic to) their operation, and 
their application to optical communications (theory) is summarized. 
In this context, we are considering channels in which the electromag- 
netic field is not subjected to any propagation effects other than a 
geometric loss. (Such a channel would exist between satellites.) 
Consequently, we will concentrate on optimum time processing using 
the tools of statistical communication theory. 

Fundamental to the study of a detection process is the need to 
develop a good mathematical model to describe it [1]-[6]. Therefore, 
approximately one-fifth of the paper is devoted to establishing, in a 
semi-classical analysis, the quantum detector output electron num- 
ber as a conditional Poisson process with the conditioning variable 
being the modulus of the electromagnetic field. Once this has been 
established, these results are used to derive various limiting probabil- 
ity densities related to actual practice. Although the mathematical 
details are omitted, these results will be presented from the view- 
point of orthogonal function expansions and interpreted in terms of 
an eigenspace. 

The resulting current flow is analyzed next as a shot noise process, 
and the power density spectrum is calculated. Attention is focused 
on isolating the signal components from the noise in terms of both 
the current probability density and the power density spectrum. 
Examples are given where appropriate. At this point, an understand- 
ing of the underlying noise processes will have been presented and 
attention will shift to analog and digital communications. 

The analog communication will be presented primarily in terms of 
the signal-to-noise ratio. The S//V ratio in direct detection will be pre- 
sented both as a ratio of the integrals of two separate portions of the 


Manuscript received May 15. 1970. 

S. Karp was with the NASA Electronics Research Center, Cambridge, 
Mass. He is now with DOT Transportation Systems Center, Cambridge, 
Mass. 

E. L. O'Neill is with the Worcester Polytechnic Institute, Worcester, 

Mass. 

R. M. Gagliardi is with the University of Southern California, Los 
Angeles, Calif. 


spectrum and as a ratio of two moments of the probability density 
describing the current. These calculations will be extended to include 
heterodyne detection. 

Digital communications will be discussed in the context of detec- 
tion theory. It will be shown that the likelihood ratio is often a mono- 
tonic function of the random variable representing the number of 
electrons flowing. Hence optimum processing will consist of a 
weighted count of electrons from various counting modes Digital 
design will be presented in terms of M-ary signaling, error probabil- 
ities. and information rates. 


I. Introduction 

W E BEGIN with a classical description for the en- 
ergy and momentum densities of a radiation field 
for both the single- and multimode cases. Confining 
our treatment to the semi-classical theory, we sketch briefly 
the argument that the probability of ejecting an electron 
from a photo-cathode surface in a short time interval is 
proportional to the light intensity. From this point of view, 
we deduce an expression for the probability of releasing n 
photoelectrons in a time T in terms of a weighted Poisson 
distribution. The weight factor is the probability distribu- 
tion for the accumulated energy received on the photode- 
tective surface in equal times. 


A. Semi-Classical Theory 

1) Normal Mode Decomposition of the Field: We begin 
our description of the semi-classical theory of radiation and 
matter by writing down the free space wave equation for the 
vector potential -4(r, r) 


va-J^-0. 

c 2 dt 2 


( 1 ) 


1612 


PROCEEDINGS OF THE IEEE, OCTOBER 1970 


Electing to work in the Coulomb gauge, div /j=0, the 
electric and magnetic field vectors are now given by 

„ dA 

l ~-T, 

8 = curl A. (2) 

Concentrating first on a single mode of the radiation field, a 
plane wave is characterized by the components of the wave 
vector K=(k x , k : ) where co = |k|c. However, even after 

specifying the direction and frequency of a plane electro- 
magnetic wave, there still exists the possibility of two in- 
dependent, orthogonal polarization directions, and a 2 . 

A plane wave, then, at frequency to propagating in the 
direction k can be written as: 

A{r .t) = a(t)exp(ik ■ r) + fl*(r)exp(-t/c • r) (3a) 

E = iw((a exp (iH ■ r ) — a * exp ( — iJc ■ r )) (3b) 

B = i[(E x a)exp(iTc • r )— (£ x a*)exp( — iH • r)] (3c) 

where 


3 = (0,5! + a 2 ff 2 )exp( — itor). 


It will also turn out to be useful to list the energy density p 
plus the linear and angular momentum densities g and m 
associated with this wave. 


E D + 8 n „ 2 , l2 

p = = 2a> e 0 \a\ 


- E x R 

9 = — 

c 


Ho? 


(A x curl A) = — - £ ° |a| 2 £ 


in }(A x A) = 2co£ 0 (|fc + | 2 — |fc_| 2 )£ 

Po c 


(4) 


where 


b± = ~(a 1 ± ia 2 ). 

We are following here the notation of Louisell [7], The 
ambiguity in sign in the last expression is removed when we 
choose either right- or left-handed circularly polarized light. 
Of course, for linearly polarized light, a, and a 2 are in phase 
so that with |fi + | 2 = |fi_| 2 no net angular momentum is 
propagated. We also add in passing that the second term in 
(3a) is added to ensure the reality of A = A*. A plane wave 
traveling in the opposite direction ( — k) is obtained by 
changing the sign of k . Finally, a standing wave is described 
by taking a linear combination of the expression with 
+ k and - £ . Before moving on to the multimode descrip- 
tion of the radiation field, we will now select a single 
polarization component of the field and decompose this 
complex quantity in the form: 


a, = 


(mi + ipj) 


l 

v/4e„a> : 
a * = ~r A A toqj ~ ‘ Pi >• 


v /4 E 0 (O 2 


( 5 ) 


Under this transformation of variables, the energy and 
momentum densities become 


p) + o 2 q] 

Pi Hj 

H i r 


16) 


so that as far as energy and momentum considerations are 
concerned, the radiation field can be treated as a simple 
harmonic oscillator obeying Hamilton's canonical equa- 
tions of motion 



Pj = 


8Hj 

cqj 


(7) 


Turning now to the multimode description of the field, we 
impose periodic boundary conditions by introducing the 
triad of integers /,, / 2 , and / 3 into the relation 


k = (k„ k y . 


K) = ^(UJM 


( 8 ) 


For economy in notation, we will henceforth use the symbol 
/ to imply this triad, and all Fourier sums will be treated as: 

LLX-I- w 

/,■ i 2 - h i 

Moreover, the orthogonality relation 


* L 

'L r 

Jo . 

o J 


exp [/(£, 


£,.) • r ] dx dy dx = VS W (10) 


taken over a cube of volume V = L 3 will guarantee that each 
mode will contribute independently 1 to the total energy and 
momentum of the field. 

We are now in position to put all these pieces together. 
Starting with the multimode description of the vector 
potential 

A(r,t) = ££ a,„ exp(/£, ■ r) + complex conjugate (11) 
1.0 

and introducing the canonical variables q ta and p la through 
the relation 


a i« = /, * , (mit* + 'PJ ( 12 ) 

V4e 0 Fc of 

we may now list the expressions for the total energy and 
momentum of the field in the form 


r 7 Y'Y' ■. 1 / 2 * V V Pi' 4" 9lr 

U = LL 2c = XX ^ 

1,0 1,0 ^ 

= XXtf„ = // 


1 Of course, this lack of cross terms in adding up the total energy of a 
system is the whole idea behind normal mode decomposition. Also, 
choosing plane wave eigenfunctions, orthogonal over cubic geometry, is 
merely the simplest way to proceed. Ultimately, we will work with the 
mode density, in which case the size and shape of the cavity will not appear. 


KARP et ai: COMMUNICATION THEORY FOR FREE-SPACE CHANNEL 


1613 


e = = {<■ 

l.o C 1.0 c 


(13) 


These equations indicate that so far as energy and mo- 
mentum are concerned, the radiation field may be con- 
sidered as a collection of oscillators, each contributing 
(per mode) to the total energy and momentum. We point 
out here that a quantum oscillator's level of excitation is 
given by H,„ = n la h al and when this condition is inserted into 
(13) there results the conclusion that a radiation field may 
be treated as a superposition of discrete photons, per 
mode, each possessing energy hco, momentum hco,/c and 
angular momentum ±h. 

2 ) Interaction Between an Atom and a Radiation Field: A 
complete description of the emission and absorption of light 
by an atom influenced by a radiation field is well beyond the 
scope of this paper. The reader, interested in the details of 
the process, is urged to consult [7]-[10], We present here 
only a bare outline of the approach insofar as it related to 
the photon counting distribution. 

Starting with the complete Hamiltonian for a charged 
* particle in an electromagnetic field 


_{p - eA) 2 
2m 


+ H k + eV 


(14) 


we neglect the term in e 2 and use the gauge condition div 
A = 0 to reduce this to 


which one can determine |Cj-(t)| 2 , the probability of finding 
the combined system in the final state | />. Summing over 
all final states, and making a number of simplifying assump- 
tions [8]- [10], one ends up with Fermi’s “golden rule” for 
the probability per second for a transition in the form 

^Sy|</|//,|<>| 2 P(£ / ). (20) 

Here p(E f ) is the density-in-energy of the final states, and 
</|f/ ; |i) represents the matrix element of the perturbation 
Hamiltonian between the initial and final states. When 
applied to the problem of an atom in a radiation field, one 
must distinguish between the cases when only the atom and 
both the atom and the radiation field are treated as quan- 
tized systems. In the former, the semi-classical treatment, 
one can correctly deduce Einstein’s B coefficient for stimu- 
lated emission and absorption in terms of the electric dipole 
moment taken between the initial and final wave functions 
of the atom. On the other hand, when one also quantizes the 
field including the zero point fluctuation, then (20) also 
predicts the existence of Einstein's A coefficient for spon- 
taneous emission. 

J) Photon Counting Statistics: The consequence of (20) 
that is of importance to us is that it leads [8] to the result 
that in a short time At the probability of ejecting an electron 
from an atom on the surface of a photocathode is propor- 
tional to the incident intensity of the light I{t). That is 


H = H a + H r + H, (15) 

where H A = (p 2 /2m) + eV is the Hamiltonian of the atom, 

l a 

is the Hamiltonian of the radiation field, and 
H,= —(e/m)A ■ p is the interaction Hamiltonian. Combining 
the first two terms into the unperturbed Hamiltonian 
H 0 = H A + H R , we next treat H, as a perturbation and at- 
tempt to solve the Schrodinger equation 

(Ho + H,#> = <*“• U6) 

Using the method of first order perturbation theory, we 
attempt an expansion of |i j/} into a linear combination (with 
time varying coefficients) of the eigenstate |i/f°> of the 
unperturbed Hamiltonian, known to satisfy the equation 


P,(t,r + At) = a/(t)Ar. (21) 


For sufficiently short times P 0 (t, t + At) s 1 — a/(f ) At so that 
in an interval (0, t + At) there are but two ways of releasing 
n photo-electrons, given by 

P„(0,f + At) = P„_ ,(0, t)a/(f)At + P„(0,t)(l - a/(t)At). (22) 

Subtracting P„(0, f) from both sides and dividing by At 
before passing to the limit, we can write 


^"= a /(t)P„.,(f) - a/(f)P„(t). 
at 

The solution to this differential-difference equation is 


(23) 


Ht’)dt' 


exp 


PnU) = 


— a J* I(t')dt' 


nl 


(24) 


H 0 \^> = ih^p- (17) 

In this expansion we have 

|(A> = iQOexp^- (18) 

and the probability of finding the system in a state |i/^> is 


Now if this process were carried out a number of times over 
similarly prepared realizations of the field, the average over 
this ensemble would lead to 

foo 

(aw)" exp ( — aw) 

P„(f, T) = J? P(w) dw (25) 

n ! 

where 


|Qt)| 2 = |<«A n »| 2 . (19) 

w = 

Assuming then that the combined system, atom plus radia- 
tion field, begins in some initial state, |/>, (18) implies a set of and P(w)dw is the probability for w to lie in the range 
coupled equations for the probability amplitude (C„(f)) from (w, w + dw). 


rt+T 

J. 


I(t')dt' 


1614 


PROCEEDINGS OF THE IEEE. OCTOBER 1970 


4 ) Mode Density: So far as the question of density of 
radiation inodes is concerned, we can start from one of 
several points of view. From the viewpoint of wave optics, 
light of wavelength / emerging from a slit of width Av can 
be expected to produce interference and diffraction effects 
over an angle Ax such that A.\Ax /.. Extending this notion 
to the elemental area As = A.vAv we see that 


AxA/J 


/} AA 
As R 2 


(26) 


In terms of the “coherence area,” this can be written as 


AA - 


As R 2 


AQ 


(27) 


Further, if light proceeding from As has a bandwidth Av 
then there exists a “coherence time” At - I Av correspond- 
ing to a "coherence length" Al~cAt ~c/Av. Dividing by 
two to take into account the two independent polarization 
states, we now write for the “coherence volume" 


II. Compound Photocurrent Distributions 
It is clear in view of the preceding discussion that 
when using a quantum detector, one always has a Poisson 
process governing the current flow. That is, the number N, of 
electrons flowing in any interval (0. r ) is a random variable. If 
the time-space envelope of the projected electromagnetic 
(EM) field |F(t. r)|,(0, 1 ) is given, then the probability density 
for N, = k electrons to flow in this interval is 


M*) = 


j" p\V(x.r)\ 2 dxdrJ 


k\ 


■ exp 




P\V(x,r)\ 2 dx dr ■ 


(34) 


If on the other hand the quantity | F(r, r)| is random or has a 
random component, then (1) is a conditional density and 
must be written as P s ,(k |F(r, r)|). To find P Nl {k) requires the 
additional averaging 


AV 


AAAI (cAr)A 2 1 

2 = ~2AD~ = 2AQ(v 2 /c 3 )Av 


(28) 


In a volume V , we expect to find AN = V /AV modes, or in 
terms of mode density 


P Ni (k) = E^[P Ni {k/\V(t, r)|], (35) 

For the purpose of this discussion we will assume that the 
integration over the detector surface merely yields a con- 
stant (e.g.. a point detector) and that we can write 


N, = 


AN 

VKv 


= (2)(Afi) 


(29) 


A J 


■r r i 

P\V(x.r)\ 2 dx dr = x |a(r)| 2 dx 

o Jo 


For isotropic radiation, this reduces to the familiar expres- 
sion 





(30) 


From a purely quantum statistical point of view, the ele- 
mentary cell size in phase space is given by 


AxAp x AyAp y AzAp. ~ h 3 (31) 


so that for a beam of photons of momentum p — hv/c in a 
solid angle AH about p we have : 


AxA.vAz ~ 


h* 

p 2 ApAQ 


h 3 

(Ii 3 v 2 /c 3 )AvAD 


(32) 


with a = »;//»', p the quantum efficiency, and |a(t)| 2 the in- 
stantaneous power in the received process. Notice that 
|a(r)| is the envelope of the received process and that (35) 
really amounts to performing the final average over the 
statistics of the envelope. 

In most communication problems (and the ones which 
we will consider), the function a(x) can be expressed as the 
linear sum of a known signal ,s(r) and a noise process h(t). , 
The signal may also contain a stochastic parameter a to 
represent a channel disturbance such as fading. As is com- 
mon at lower frequencies, the component >i(t) can be 
accurately modelled as a Gaussian noise process. Hence 
we will assume that «(r) can be written as 

a( i, <r) = s(t, a) + u(t) 


Dividing by two to account for the two orthogonal polariza- 
tion states, we end up with, again 


AV = 


1 

2AQ(v 2 /c 3 )Av 


(33) 


which is the complex envelope of a deterministic signal plus 
a narrow-band Gaussian noise process x(r) centered at some 
high frequency / 0 

x(t, a) = Re [a(r, <x) exp (i2jr / 0 r)]. 


It is important therefore to know how many spatial and 
temporal modes of the radiation field interact with the 
photo-detector. We shall see that a single mode of chaotic 
thermal radiation, and stabilized laser radiation lead, re- 
spectively. to Bose-Einstein and Poisson photocount dis- 
tributions. For the case of several radiation modes, one 
needs to calculate the probability distribution for the sum 
of several random variables leading to multiple convolu- 
tions. 


It is also meaningful to expand a(t, cr) in a complete 
orthonormal Karhunen-Loeve series [11] 

ac 

a( t, <r) = X a,(<7)<A,(T) 

i = 0 
ac 

= X (s.(tf) + n,#,|t) 

1 = 0 

having the following properties : 


KARP ei al COMMUNICATION THEORY FOR FREE-SPACE CHANNEL 


1615 


1) The {</>,( r)J are solutions to the integral equation 




K n (n. r)</>,(i') dv 


where 

K „( H , i>) = £[h(//)h*(i )] 

in the covariance function of the noise and is a real function. 


2 ) 


a t (a) = J a( r, a)<j>*i 


f(r) dr =(«, ) = (s, 4>i) + {n, <t>i). 


3) The equality is in the sense of “limit-in-the-mean.” 

4) to. 

5) The a^o) are independent Gaussian random variables, 
when conditioned on a. 

The generating function of this process N, can then be 
written as [12] 


M Ni (s) = E^e 


|a(f, ff)| 2 r/f[<?" - 1] 


exp ^x 

= E^exp^x £ |a,(a)|V - 1) 

which, using property (5) and [13] reduces to 
M N ,(s) = f[ £[exp(a|a 1 (ff)| 2 [e" - 1])] 

i = 0 

explals.fff)! 2 ^" - l)/[l - x/dc" - 1)]) 


- n 


1 - xA.ft'' 1 - 1) 


(36) 


At this point, the variable a will be suppressed, although it 
must be considered as a conditioning variable when en- 
countered in practice. 

Notice that M N ,(s) is a product of similarly distributed 
functions. The inverse transform of the ith component is 


J (1 +«,)’**' P l I + tu 


-as,- 




(37) 


where L x (y ) is the Laguerre polynomial. 

1) No Additive Noise: Fn the limit as A, -*0(37) approaches 

jim P N , (ki) = exp ( - x| Sj | 2 ) 

and 

[« t w a T , oo 

P^) = L '- kl J exp^-x X |s,-| 2 

= “ exp ( - aE s ) (38) 

fe! 

where k = ]T,® 0 k t is the total count and E s =Yj > =o | s i| 2 * s the 
total signal energy in the (0. t) interval. Thus the deter- 
ministic signal alone yields a Poisson distributed count. 
This, of course, could have been deduced immediately 


from (34). Notice, however, that when |.s,| 2 = 0 


P v, (^i) — 


(xA,)* 


(1 + xA,) 


i fc. 


(39) 


and each of the coordinate components is Bose-Einstein 
distributed [4], 

In summary we see that : the signal alone can be con- 
sidered to be Poisson distributed along each of its co- 
ordinate axes in Hilbert Space; Gaussian noise alone is 
Bose-Einstein distributed along a particular set of coordi- 
nate axes in Hilbert Space; when signal is added to the 
noise the resultant process is distributed according to (37) 
along each of the coordinate axes determined by the noise 
alone. 

2) Bund-Limited White Gaussian Noise: An important 
case occurs in communication theory when the signal and 
noise are passed through a filter before detection. We will 
consider the case where the process u(r) is band limited by 
a rectangular filter with bandwidth 2 B. We will also assume 
that the noise was initially white, with spectral density N n . 

It has been shown [14] that when a process is band 
limited and then observed over a time interval (0. t) the 
eigenfunctions are prolate spheroidal wavefunctions. It has 
also been shown that the first (2Bf+l) of these functions 
accurately approximate the original function. This appears 
valid for values of 2Bt as low as 3 and 5 [11]. Therefore, it is 
a good engineering approximation to assume that the 
eigenvalues associated with the first (2Bf+l) coordinates 
are each N n with the remaining ones being zero. The gen- 
erating function M s , in) in (3) then becomes 


M Ni (s) = 


x(s, sHe" - 1) 1 

_ eXP Lj - a N 0 (e* - 1)J 

[1 - a.N 0 (e M - l)] 2fl,+ 1 


(40) 


with the corresponding probability density being 

, (xN 0 )‘ 

P 1 0/ 

<1 + xN 0 ) k+2B,+ 1 


■ exp 


~*(s, s) 1 -x(s,s) I 

I + xN 0 J ‘ |_xN 0 ( 1 + xN 0 )J 


(41) 


where L 2B, (x) is the Laguerre polynomial. We will now 
consider some limiting forms of (41). 

3) No Signal: In the absence of signal, (41) reduces to 



which is a negative binomial distribution. There are two 
important limiting cases for this distribution. 

a) Limit 2Bt -* 0 


Ps,(k) = 


{*N 0 ) k 

(l + q/v 0 )‘ +l 


For 2Bt « 1, there is only one significant eigenvalue, the 
average value. Since this occurs when t«l/2B, it can 
clearly be related to the approximation 


1616 


PROCEEDINGS OF THE IEEE. OCTOBER 1970 


|a(x)| 2 dx i |a(0)| 2 f = k 0 t (42) 

Jo 


using the mean value theorem for integrals. This latter 
approximation is commonly used to obtain this result but 
lacks the insight as to the meaning or the range of validity. 


b) Limit 2 Bt large, nN 0 « 1 


P»,(k) 


[«2 BtN 0 ] k 
k\ 


exp ( — oc2BtN 0 ). 


Notice that since 2 N 0 B is the noise power, 2BtaN 0 is the 
total noise energy in the (0, r) interval. If we write this as it, 
we have: 


a|u(f)| 2 dt = It 
Jo 

where / is in fact the time-averaged noise power 



Thus for large 2 Bt, there is a smoothing of the fluctuations 
in the noise process, and Poisson statistics prevail. The 
condition otN 0 «l is a little difficult to interpret, except 
that it implies there be much less than one noise count 
per degree of freedom, which is easily obtained in practice. 
If one recognizes that a narrow optical filter has a band- 
width on the order of 1 A at visible wavelengths, or about 
100 GHz, it is clear that large 2 Bt is the most common 
form of operation. 2Bt will be comparable in magnitude to 
the ratio of the optical filter and system bandwidths. 
Further, since almost all noise has a thermal origin. 


txN 0 


n 

exp (hv/kT) — 1 


« 1 


is satisfied at optical frequencies. Actually, this is true assum- 
ing one mode of operation. However, for the purpose of 
this discussion we have considered a plane wave, or one 
spatial mode. 

4) Signal Plus Noise: For this case, there are also two 
limiting conditions for (41). 

a) Limit 2Bt -*• 0 


P N ,(k) 


(«No)* , r -a(s,s) 

(1 + atN 0 )' +k ^ P [l + *N 0 


L k \ 


— a(s, s) 
aN 0 (l + a N 0 ) 


As in the case for no signal, the probability density reduces 
to that of an individual coordinate (37). Again this can be 
interpreted as the zero order eigenvalue or average value, 
as in (42). 


b) Limit 2 Bt large, a N 0 » 1 


P*fik) 


[«{2 BN 0 t + (s,s)}]* 
‘ k\ 


exp[ — a{2 BN 0 t + (s, s)j]. 


As might be expected from condition 3a) and (38), the 
limiting condition for large 2 Bt and a N 0 « 1 corresponds 


to a Poisson-distributed signal plus independent Poisson- 
distributed noise. Since this is the most common condition 
that one encounters in practice considerable effort has gone 
into exploring this approximation [17]— [20]. 

5) An Equivalent Eigenspace: Let us reexamine (37) and 
(41). Equation (41) is obtained as a (2 Bt + l)-fold convolu- 
tion of probability densities in (37), where all the 7., are equal 
to N 0 . This can be written as 


P»,(k) 


2 ®' («N 0 f‘ 

. = o ( 1 + *KoV +k ‘ 

-«N 2 ^ 
«N 0 ( 1 + «H 0 )J 


exp 


1 -I- xN 0 


(43) 


where ®ff' 0 denotes a (2B?+1) fold convolution. Notice 
that the only way in which the signal enters is through the 
energy (s, s). Now 



and since the signal is band limited to ± B, we can partition 
the (0, t) interval into (2flr+l) equal Ar intervals where 
(2Bt+ 1) AT =r. We can then closely approximate (s, s) as 


(s, s) 


IBx 


I 


|s,.(yAT)| 2 AT, 


j = 0, 1, 2, • • • , 2Bt. 


We can also write k as k = Yj=o where kj is the contribu- 
tion of the jth interval to the total count k. Equation (41) 
can then be decomposed into a (2 Bt+ l)-fold convolution 
of the form 


Ps,(k) = 


W' 

jfod +«N 0 ) ki+1 

•expf -^1 2AT V ( ) 

P l 1 + <* N o ) J \aN 0 (l + txN 0 )J 


(44) 


Notice that (44) is equivalent to (43) and would be identical 
if |s j | 2 = |s J | 2 AT for all i=j. On the other hand, (44) is mean- 
ingful as representing a processable signal formed from in- 
dependent samples as opposed to an abstract eigenspace. 

For the particular case where the noise process is wide 
sense stationary and 2Bt is large (see, for example, [11]), 
one can approximate the eigenfunctions by harmonically 
related cissoids, and |s,| 2 and N 0 represent the Fourier co- 
efficients of the power density spectrum. Equations (43) and 
(44) then express the duality of signal processing and design 
in both time and frequency. 

We can elaborate on this duality using the time-frequency 
representation first considered by Gabor [21] (see Fig. 1). 
The received process a(r) considered, exists over the in- 
terval (0, t), with frequency components primarily con- 
tained in the interval ( — B, + B). This is a Hilbert space of 
(2Bf+l) dimensions which can be considered either as 
intervals of bandwidth 1 /t in frequency or duration 1/2B 
in time. Hence we can observe the count kj by looking in 
the time interval (j/2B, j + 1/2B) with a filter of bandwidth 
2 B or we can observe the count k , by looking in the frequency 


KARP el a I COMMUNICATION THEORY FOR EREE-SPACE CHANNEL 


1617 



Fig. I. Time-frequency representation in terms of i—j intervals. 


band ( - B + i/t, — B + (i + l)/f) for a time t. The first measure- 
ment is a sum of all the squares in the y'th column, while 
the latter is a sum of all the squares in the ith row . 

If the process is not wide sense stationary, we can still 
use Parseval’s Theorem to write (s, s) as 

r oo 2Bl 

(s,s)= P(f)df ^ X POAfW 
Jo 1 = 0 

and write a density similar to (44). K, would be the total 
count in the band A/ in the interval (0. /). However, one 
cannot assign the rigorous definition of power density spec- 
trum to the noise and the noise coefficients. 

We note, finally, that the most common statistical be- 
havior encountered in practice yields 2Z?A7"»I. Hence 
condition 4b) applies to any measurement interval of 
length AT. Thus the observance of counts over many inde- 
pendent AT intervals is a sum of independent Poisson vari- 
ables. This interpretation was first proposed by RcitTen and 
Sherman [17] on the heuristic basis, but can clearly be 
shown to have a solid foundation. 

III. Shot Noise Processes 

We have shown that a linear relation exists between the 
average power / of the radiation (over some finite aperture) 
and the rate of flow of photons n. Thus if n is a function of 
time, we can write 

/(f) = hvn(t) (45) 

where h is Planck's constant and v is the photon frequency. 
Thus the detector of optical radiation can be represented 
either as an instantaneous power detector or as an in- 
stantaneous rate detector. This relationship is generally ex- 
plained by postulating that each incident particle inde- 


pendently releases an electron with probability t] upon 
arrival at the photodetector surface, the electron in turn 
traveling to a cathode surface yielding a current “impulse" 
effect at the detector output. Thus the total output current 
i(f ) is due to the motion of a collection of electrons, propor- 
tional in number to the arriving particles. We can, therefore, 
write for the output current flow i(t) 

«(f) = X h(t - tj (46) 

m — 1 

where h(t) is the current impulse effect, t m is the time of 
release of the with photoelectron, and N, is the number of 
such electrons occurring in the total time interval (— xi, t ). 
The function h(t ) has area equal to the charge of an elec- 
tron, while N, is the counting statistic, discussed in Section 
II, of the photoelectron emissions. Note that if we neglect 
space-charge effects in the photodetcctor, the travel time 
of each released photoelectron is finite, which means that 
the function h(t) must be time limited to some interval t. 
That is, /t(f) = 0 for f <0 and f >r. In this case. N, becomes 
the counting statistic over the finite interval If — t, f). Since r 
is inversely related to the detector bandwidth, r is relatively 
short ( 10 ~ 10 — I0~ 7 s), and can be considered a “delta 
function” with respect to most modulation waveforms. It 
perhaps should be pointed out that if h(t) is assumed to be a 
flat rectangular function over (0. t). then i(f)= /V, — IV, r and 
the detector output is precisely the counting process of the 
received optical radiation. If, instead, a nonrectangular im- 
pulse waveshape is to be accounted for, then one is forced 
into a closer examination of the processes described by 
(46). This class of processes can loosely be defined as “shot 
noise" processes (although the exact definition of the latter 
tends to vary in the literature). 


1618 


PROCEEDINGS OF THE IEEE, OCTOBER 1970 


As discussed in Section II, the parameter N, is a random 
variable depending upon the intensity of the received field. 
Recall that if n(r) is a deterministic function, N, is a Poisson 
random variable, with mean value given by the integral of 
rjn(t) over (r-r. t), and is a conditional Poisson random 
variable if the intensity n(r) is a sample function of a con- 
tinuous stochastic process. That is, given any intensity 
function of the ensemble, the counting process N, is Poisson. 
With Poisson counting processes the resulting shot noise 
processes are referred to as Poisson shot noise (PSN). Some 
excellent discussions of PSN processes are given by Rice 
[11], Middleton [22], Papoulis [24], and Parzen [23]. In 
essence, first- and second-order statistics, such as prob- 
ability densities, moments, power spectra, and correlation 
functions have been well developed. For the conditional 
PSN. the foregoing statistical characteristics can be formally 
attained by taking subsequent averages over the PSN re- 
sults. For example, consider the power spectrum of the con- 
ditional PSN process in (46), where the intensity n(t) is a 
sample function of an ensemble of positive random sta- 
tionary process N defined over ( — oo, oc). We formally de- 
fine the time averaged power density spectrum [25] of the 
shot noise process i(t) by 


latter two components constitute the “fluctuation” noise of 
the photodetector output. Since the spectrum in (50) has the 
form of a "signal noise." there is a tendency to view the 
photodetected output as signal plus additive noise. The dif- 
ficulty. of course, is that the signal and noise are not inde- 
pendent, and usual "signal plus noise" interpretations, 
familiar to communication engineers, often lead to false 
conclusions (e g., see Section IV). 

It is often instructive to examine the “instantaneous” or 
"short-term averaged" power spectrum of the detector out- 
put. which can be viewed basically as the conditional 
spectrum of (48) before the time averaging limit is taken. If 
we interpret the 2 T interval to be the interval (t — r, f), 
instead of ( — T, T), we see that the bracketed term in (48) 
will contain terms dependent on t. Furthermore, if we in- 
clude the fact that the electron functions h(t) have time 
widths r much shorter than the time variations in n(t), then 
the intensity n r (t) is approximately constant over (t — r, t). 
I ts power spectrum is then a delta function and the bracketed 
terms in (48) take the form 


Mr) + " 2 (0 


sin ojz/2 
air/2 




SAto) = lim — £[|/ t (oj)| 2 ] (47) 

T-oo 

where £ is the expectation operator and I T (w ) is the Fourier 
transform of i(t) over ( — T, T). For the PSN processes, 
(47) can be readily determined as 

S PSN (oi) = lim -!= [N t + |<D r (a))| 2 ]|H(a))| 2 (48) 

r— co *■ 1 

where 



and H(id) and <J> T (co) are the Fourier transforms of h(t) and 
n(t), — T < t < T, respectively. The subsequent statistical 
average over N, and time average over T via the limiting 
operation, yield the power density spectrum for conditional 
PSN processes 

S»(a>) = |W(m)| 2 [£(N) + S„M] (50) 

where S N {w) is now the time averaged power density spec- 
trum of the stochastic intensity n(t). The foregoing results 
are significant, since it is valid for any counting statistic 
generated from conditional Poisson statistics and, there- 
fore, includes those discussed in Section III. Note that the 
spectrum always takes the form of the intensity spectrum 
immersed in a background of “noise” of spectral shape 
E(N)\H(w)\ 2 . (For infinite bandwidth detectors, //(a>)% 1 for 
all co, and the aforementioned represents basically “white” 
noise.) This noise constitutes the shot noise of the detector, 
and is due to the discreteness of the photoclectron model. 
The intensity spectrum S n (oj ), in general, contains portions 
due to desired intensity modulation, portions due to back- 
ground effects, and associated cross-spectral terms. These 


That is, the instantaneous spectrum (power spectrum before 
the time average is taken) has the appearance of a back- 
ground shot noise whose level varies with time, and whose 
average value varies according to the instantaneous value of 
n(r). In this sense, the detector acts as an instantaneous 
“power" detector, which is the accepted classical definition 
of photodetectors. The true frequency content of the shot 
noise is not exhibited, however, until the time averaging is 
invoked. 

The foregoing discussion raises an interesting query that 
cannot be answered from a spectral density point of view. 
If the shot noise process is to represent a true intensity 
detector, even when n(t) is a stochastic process, then the 
statistical properties of the shot noise in (46) must be related 
to those of the intensity process n(f). When the intensity is a 
deterministic time function, the relations between the shot 
noise and its intensity are well known. However, when the 
intensity is itself stochastic, the manner in which the 
statistics of the intensity and the conditional PSN are re- 
lated is somewhat vague. For example, although the first- 
order probability density of i(t) is difficult to write in closed 
form, its characteristic function is immediately available by 
making use of the known characteristic function of PSN 
[12], [23], [24]. Thus 


0i,M = Mi 


exp 


i r 

r— 1 ’ * Jr - 1 


n(p)h r {t - p)dp\ (51) 


where E N is the average over the process N. One way to 
interpret (51) is to assume infinite bandwidth detectors, and 
factor the first term of the exponential summation. Thus 


<p it (co) = £*{exp (jion(t))G[(D , n(r)]} (52) 

where the G function represents the remaining factors. The 
average of the first term alone is precisely the characteristic 


KARP n 111.: COMMUNICATION THEORY TOR FREE-SPACE CHANNEL 


1619 


function of the intensity process N at any time f. Thus the 
effect of the function G is to cause a departure of the first- 
order probability density of i(r) from that of n(f). The condi- 
tions under which the latter effect is negligible, and the shot 
noise process has approximately the first order density of N, 
have been studied by Karp arid Gagliardi [26 J. In this latter 
instance, we can say that the shot noise represents (statisti- 
cally) the intensity process. This representation can be re- 
lated to the “denseness” of the photon arrivals; i.e., the 
average number of photons per second. In fact, when the 
latter parameter is large, it can be shown that the bracketed 
term in (52) is approximately the characteristic function of a 
Gaussian random variable, with mean //(r) and variance 
n(t). This infers that the conditional (on N) probability 
density of i(t) at any t approaches asymptotically a Gaussian 
density, which again may be loosely interpreted as an in- 
stantaneous “signal" //( t ), immersed in additive nonsta- 
tionary Gaussian noise of variance /i(r). 

The relation between shot noise and its intensity can be 
further investigated by consideration of the individual 
, moments of the two processes. The moments of the process 
/'(f) can be obtained from its semi-invariants, which are, for 
PSN processes 

-U0 = | hV ~ pHp) dp. (53) 

The moments can then be obtained by the sequence of rela- 
tions £(/) = £(/' 2 ) = 7. 2 +/.?, £(i 3 ) = ;. 3 + /.,;. 2 -|-/. 2 , etc. For 
conditional PSN processes, the are themselves random 
processes, and the moments of i(f) depend upon the higher 
order moments of the process //(/). However, if the intensities 
are continuous, or if the detector bandwidth is much larger 
than the bandwidth of the intensities, the rth moments are 
related by 

£(f) = E(N r ) + D(r) (54) 

where £>( 1 ) = 0 and D(r), r>2, is function depending upon 
the higher order statistics of //(f) and upon the function 
h{t). This relationship was investigated in [26]. It was 
shown, for example, that if the function h(t) was rectangular 
over (0, t) the rth moment of /(f) was approximately equal to 
the rth moment of the intensity process N if 

average number of 
photon arrivals in 
r seconds 

Equation (55) essentially states that the denseness of the 
shot events (i.e., the average number of h(t) functions over- 
lapping the time interval of one function) must be suffi- 
ciently large for moment representation. The right side of 
(55) serves as a rough rule of thumb for determining how 
large this denseness must be for approximate equality of the 
rth moment. It may be recalled [20] that for PSN processes 
(deterministic intensities) a condition of large number of 
shot occurrences is required before the PSN loses its discrete 
nature. Equation (55) can therefore be interpreted as the 
statistical equivalent of this statement; i.e., the condition 


under which the conditional PSN begins to take on the 
statistics of its intensity. 

By using (54), it is also possible to relate the fluctuations 
in the detector output i(f) to those of the intensity //(f). 
Specifically, if we define the signal-to-noise ratio (SNR) of a 
positive process as the ratio of its mean value squared to the 
variance, then (54) leads to the fact 

SNR of i(f)< SNR of/i(f) (56) 

which implies that the percent fluctuations in the shot noise 
are always at least as great as those of the intensity itself. 
We make this point mainly because the foregoing definition 
of SNR is commonly used in assessing signal quality in 
communication system analysis. 

It may be noted that the conditions for which the inten- 
sity is represented by a shot noise process are also useful in 
"building up” intensity models as shot noise. This type of 
shot noise modeling has been used for studying radiation 
scattering and perturbation effects [27], [28] in which the 
impulse functions //(/) were reinterpreted as wave packets. 

With the statistics of the conditional shot noise process 
identified (at least in first- and second-order statistics), the 
problem of optimal processing procedures at the photo- 
detector output can now be properly formulated, and in 
some instances, solved. For example, the problem of op- 
timal linear filtering of the process /(/), so as to minimize 
the mean squared error from the desired intensity, was con- 
sidered in [26], For certain types of pulsed intensities, as in 
PCM communications optimal operations maximizing 
output signal to noise ratios have also been considered [29] 
The application of estimation theory [30], tracking opera- 
tions [31], and detection procedures [17], [18], [20] to the 
photodetector shot noise output has been under study, and 
appears to be a problem area of considerable interest from 
both a practical and theoretical point of view. 

IV. Digital Communications and 
Optical Systems 

The availability of an easily generated extremely narrow 
pulse in the optical region of the spectrum suggests a natural 
application to communication by digital methods. This 
notion, in turn, has fostered an increasing interest in the 
application of both classical detection theory and informa- 
tion to optical systems. Since the output of a photodetector 
is a sequence of electron counts, the detection problem is 
formally one of decisioning in the presence of generalized 
Poisson statistics. While early approaches to the problem 
basically were confined to pure Poisson counting [32], [33], 
more recent attention has included the generalized Laguerre 
counting processes in Section III, [13]. 

The formulation of the general M-ary detection problem 
involving counting statistics proceeds as follows. The 
transmitter sends a signal whose intensity is modulated 
with one of a set of M possible intensities, each T seconds 
long. The received signal is corrupted by background radia- 
tion. which we assume here is white Gaussian noise of level 
N 0 watts per hertz per unit area, and optical bandwidth B. 
The output of the photodetector at the receiver is then a 


» 


r(r - 1) 


(55) 


1620 


PROCEEDINGS OF THE IEEE. OCTOBER 1970 


time varying process of electron counts, obeying a general- 
ized Poisson distribution, as in Section 111. The receiver 
observes the counting process over (0. /) and decides which 
of the M possible intensities is being received. Since K 
binary digits can be uniquely encoded into 2 K = M possible 
intensity waveforms, a correct decision effectively decodes K 
data bits. The foregoing model can be cast into a discrete 
format by subdividing the interval 7 into AT-second 
intervals (AT * 1/information bandwidth) and associating 
a signal energy component Sj, for the jlh intensity and ith 
interval. (That is. s Jt is the total energy associated with the 
2BAT samples, or modes, of the Karhunen-Loeve expansion 
of the jth intensity during the ith AT interval.) Under a 
fixed energy constraint, we require = E for all ry. The 
discrete problem then is to detect which of the possible 
intensity vectors = {s 9 , } is controlling the counting 

process by observing the sequence of independent counts 
= i=l, 2, •••, M( = T/AT). Under a maximum 
likelihood detection criterion, and a priori equally likely 
signals, the optimal test is to form the likelihood functionals 
A q (k) and select s q as being transmitted if no other A ,(k) 
exceeds A q {k). If a likelihood draw occurs (more than one 
A q {k) is maximum) any randomized choice among the 
maxima can be used. From (37), the likelihood test is 
therefore equivalent to comparing: 




(57) 


for all ry, where s qi is now a normalized signal intensity obey- 
ing the constraint £<>,, = £ = A. In typical operation, 
2BAT» 1 (i.e., the optical bandwidth is much greater than 
the information bandwidth) and (1) is approximately 


\(k) s fl 

i=i 


k, 


2BAT + 


N 0 (l + xNol 


Ik 


(58) 


After observing fc, examination of the set of {A,} for 
maxima is equivalent to the comparison of the set of func- 
tions Y. ki log ( 1 +s qi /K ), where K = 2BN 0 AT represents the 
noise energy per counting interval per unit area. (Recall it 
was previously shown in Section II that under the condition 
2BAT » 1 the counts k, are Poisson variates so that com- 
plete statistics of the foregoing test can be determined.) 

An indication of the performance of a detection test is 
given by the divergence, or “expected distance between 
hypothesis.” The divergence is formally defined as 

D jq = E k (Ajj)- E k (A jq \q) (59) 

where A jq = Aj(k)- A q (k) and E k {A\j) is the conditional 
average of A over k given the intensity Sj. Abend [18] had 
shown that for Poisson counting, using the functions of (58) 
and M = 2 (binary detection), the divergence normalized 
by the variance of A is maximized by a “pulsed” type of 
intensity, in which the available signal energy is wholly 
concentrated in a single counting interval. That is. an in- 
tensity set defined by 


where S qi is the Kronecker delta function. Kailath [19] ex- 
tended this result by showing that under a total energy con- 
straint, other suitable forms of distance are maximized by 
similar pulsed intensities. Gagliardi and Karp [20] applied 
an average divergence criterion to the general M- ary Pois- 
son detection problem and again showed the optimality of 
the intensity set of (60). In the latter reference, the intensity 
set that maximized the probability of correctly detecting the 
true intensity, rather than maximizing divergence, was also 
considered, and shown to correspond to the pulsed set in 
two special cases, 1 ) M = 2 w'ith symmetric intensity sets and 
2) any M and low intensity-to-noise-energy ratio. However, 
the determination of global optimal intensity sets in the 
pure Poisson case, based upon detection probability still 
remains a difficult task. It has been conjectured by many 
that the pulsed set of (60) is. in fact, a global optimal set, 
but to the authors’ knowledge a rigorous proof has not been 
shown. The optimality of the pulsed set, even under this 
special criterion, is significant, since it indicates the impor- 
tance of intensity waveshape in digital system design. This, 
of course, is partly due to the general advantage of ortho- 
gonal signals in detectability, a property afforded by the dis- 
jointness of the pulsed set in (60). The use of signals placed 
in adjacent time slots is in essence a pulse position modu- 
lated system in which each position corresponds to a digital 
word. The dual of such a system (a frequency keyed system), 
which also retains the orthogonality property, can similarly 
be generated by redefining the expansion functions of the 
received field [34]. 

It should be pointed out that if the condition 2BAT » 1 
is not valid, care must be used in accepting the pulsed set of 
(60) as an optimal intensity set. In particular, the Poisson 
assumption and the use of (58) is violated. For the case of 
2BAT « 1, the divergence in (59) must be obtained by 
averaging terms as in (57) over the Laguerre densities. If 
this averaging is carried out, (59) takes the form 



where / 0 (x) is the imaginary Bessel function of zero order 
and C contains terms common to all q and j. Now it is no 
longer immediately evident that the pulsed set of (60) 
maximized D qj . The last term, however, is minimized if 
either s,, = 0 or s jf =0 for all i, which suggests a disjoint 
intensity set, but it is not evident that the first terms are 
maximized under the same condition. The difficulties of 
this problem are quite reminiscent of similar difficulties in 
attempting to find optimal signal sets in noncoherent addi- 
tive Gaussian noise channels. 

When the pulsed set of (60) is used and the general 
Laguerre counting is assumed, the analysis procedures are 
similar to the Poisson case. It is easy to show the mo- 
notonicity of Laguerre functions with respect to their 
indices. It then follows from (57) that A^Aj if Lj^(Af/N 0 ) 


*, = {*<*,.} 


(60) 


KARP el al. COMMUNICATION THEORY FOR FREE-SPACE CHANNEL 


1621 


^Ll(N/N 0 ) which, in turn, is true if k q ^ k t . Hence the maxi- 
mum likelihood test need only count over each interval, 
selecting the signal corresponding to the interval with the 
largest count. 

A. Error Probabilities with Pulsed Intensity Sets and 
Poisson Counting 

The performance of the pulsed intensity set in M-ary de- 
tection can be evaluated by considering the error probability 
when Poisson counting statistics are assumed. This can be 
obtained by noting that for the pulsed intensity set of (60) 
the log of the likelihood functions for each /c, constitutes a 
set of independent Poisson random variables. The variable 
for k q has intensity ( N + 2BotN 0 AT) if the ryth intensity was 
sent, and has intensity K = 2BotN 0 AT otherwise. Recall that 
if the 17 th intensity is sent a correct decision will be made 
with probability l/(r + 1 ) of the log likelihood equals r others 
and exceeds the remaining M — (r+ 1). Therefore, upon con- 
sidering all possibilities, the error probability can be de- 
rived as [ 20 ] 


P e (E, K,M) = 1 


where : 



The function P E (N, K, M) has been plotted by Pratt [32] for 
M = 2, and recently a digital computation has been gen- 
erated [23] for a complete plot of the function. An ex- 
emplary plot is shown in Fig. 2 in which P £ ((V, 3, M) is 
plotted for various M as a function of N. It is important to 
note that P f depends on both the normalized signal energy 
N and the normalized noise energy K in the counting in- 
terval, and not simply on their ratio. This fact is emphasized 
in Fig. 3, in which P £ ((V , K , 2) is plotted as a function of K 
for 2 fixed ratios N/K. This dependence on both signal and 
noise energies distinguishes the Poisson detection problem 
from the analogous coherent Gaussian channel problem. 
Note that the interfering noise energy K depends only upon 
the background energy in the interval AT, which is the 
width of the transmitted intensity pulse. The prime ad- 
vantage of Poisson systems is precisely their ability to re- 
move the effect of background noise by making AT small, 
and has been emphasized in previous reportings [36], [37], 
The actual dependence of P £ on the parameter AT has 
been considered [38], and the improvement in error prob- 
ability with decreasing AT has been demonstrated. The im- 
provement, of course, is made at the expense of information 
bandwidth and peak power, both inversely proportional to 
AT. Surprisingly, the improvement is quite small at low 
values of N, and the increase in bandwidth may not be 
worth the decrease obtained in error probability. The effect 



Average Number of Signal Counts, N 

Fig. 2. Error probabilities for AT-ary signaling. 



AVERAGE NOISE COUNT PER INTERVAL. K 

Fig. 3. Error dependence on signal and noise energies. 


on error probability of additive extraneous thermal noise 
in the decisioning system and statistical characteristics of 
photomultipliers has also been considered [38], 

For Laguerre counts, (62) must be rederived using the 
Laguerre densities discussed in Section III. Recently, general 


1622 


PROCEEDINGS OF THE: IEEE. OCTOBER 1970 


bounds or the error probability in this latter case, using the 
orthogonal (disjoint) signal intensity sets, have been re- 
ported [34], 

B. Information Rate of a Poisson PPM System 

We have so far analyzed only one aspect of system per- 
formance, i.e.. error probabilities. The actual information 
rate that the link achieves is another important design con- 
sideration. As stated, the transmitter sends optical energy 
in one of M time intervals, which is AT seconds wide, thereby 
transmitting one of M possible signals in MAT seconds, or 
at a rate log, M MAT bit/s. The receiver correctly deter- 
mines the true signal with probability 1 —P E and is in error 
with probability P E . Because of symmetry, the erroneous 
signal may be equally likely interpreted as any of the M — 1 
incorrect signals. Thus the overall channel may be depicted 
as an M- ary symmetric channel, in which each of the M 
possible transmitter signals is converted to itself with prob- 
ability 1 — P E and converted to each other signal with prob- 
ability P e /(M — 1). The information rate for such a channel 
is known to be 

_ log, M + P E log 2 [P e /(M -!)] + (! -P E ) log, ( 1 - P E ) 

MAT 

For convenience we shall denote this as 

H = C(A, K, M)/MAT (64) 

to emphasize the dependence of the numerator on the 
stated parameters. By using (63) and the families of error 
probability curves as in Fig. 2. the rate H can be evaluated 
by straightforward substitution. Although specific curves 
for such a computation are not shown here, it suffices to 
note that if A and K are such that P E < 10" then (63) is, 
to a good approximation 

H %(1 — P t )[(log 2 M)/MAT] 

= (log 2 M)/MAT - P £ [(log 2 M)/MAT], (65) 

If we interpret the rate H as the source rate minus the 
equivocation of the channel, then the PPM optical system 
behaves approximately as if a source rate of log M/MAT is 
passed into a channel of equivocation P t log M/MAT. As 
noted in (62), even if K->0 (no background interference), 
P E ^e\ p (-N)/ 2, so that the equivocation is not due en- 
tirely to the background noise. 

The use of (63) and the previous equations are helpful in 
determining the rate, given operating parameters. However, 
the converse design problem, which is to determine par- 
ticular parameter values that achieve a desired rate, is not 
so straightforward. This is due to the fact that the rate is a 
somewhat complicated function of the parameters. We shall 
consider here two aspects of this design problem that have 
practical application under certain operating conditions. 
First, the word period T = MAT is held fixed while the in- 
formation bandwidth 1/AT is allowed to vary, and second, 
the bandwidth is held fixed while the word period is allowed 
to vary. In both cases, we are interested in the relationship 
between the rate H and the transmitter parameters A and M, 
assuming that the noise power is held fixed. 


C. Fixed Word Period 

We assume here that AT is allowed to vary with M so as to 
maintain T=MAT constant. Thus the system “squeezes” 
more signals into the T-second period as M increases. The 
resulting rate is then 

H = C(A, K t /M, M)T (66) 

where K r is the noise energy in T. Thus the rate depends 
only upon the numerator of (63). With A fixed, increasing M 
increases the source rate, but the error probability also in- 
creases and eventually reaches an asymptotic value of 

for large M. The resulting system rate increases, to within a 
constant of the entropy of the alphabet. log 2 M/T. There- 
fore. it is clear that if the bandwidth is expendable, one will 
always increase the system rate for large M by increasing M. 
In a practical system, this implies that one should operate 
with as wide a bandwidth as possible to fully exploit the 
capability of the PPM system. We are. therefore, led na- 
turally to consider the design of a system for an arbitrary 
rate H. when the full bandwidth (1 AT) of the system is 
limited. 

D. Fixed Btindwidth 

In this case, AT is held constant (thereby fixing the noise 
energy K in AT) so that both the numerator and denomina- 
tor in (63) depend upon M, and the rate degrades quickly 
as M increases due to the log M/M dependence. A given 
rate, e.g., H 0 , may be obtained by many different combina- 
tions of A and M. Analytically, these equivalent operating 
points may be obtained graphically by noting that they are 
the values for which the numerator C(A, T, A/), considered 
as a function of M. intersects the straight line H 0 ATM. By 
plotting these functions for various A, their intersection 
will identify (A, M) pairs which achieve the rate // 0 . One 
may then decide on a particular operating point by invoking 
suitable design criteria. For example, one may select the 
smallest M from among the candidate pairs, which then 
minimizes the word period T=MAT. Alternatively, one 
may choose to minimize the average transmitter power per 
information bit, which is proportional to A/C. In the latter 
case, therefore, one would select the operating pair (A, M) 
for which A/C is minimal. The latter parameter is recognized 
as the ^-efficiency parameter (energy per data bit) of a com- 
munication system [39], If the value of A/C, corresponding 
to the optimal (A. M) pair, is tabulated, the results can be 
compared to previously derived performance based upon 
the same parameter. This type of comparison was con- 
sidered [40] and it was shown that the PPM system outper- 
formed an optical heterodyne system for sufficiently large 
M. approaching in fact the minimum (I generated by the 
Gordon bound for quantum systems. This type of result 
further emphasizes the importance of expending system 
bandwidth (increasing M also implies increasing informa- 
tion bandwidth) to improve overall performance. The 
effect of Laguerre statistics (when the information band- 


KARP el al. COMMUNICATION THEORY FOR FREE-SPACE CHANNEL 


1623 


width approaches the optical bandwidth) and the effect of 
additive noise can be accounted for by modifying these 
Poisson results [40]. 

The extension of the discrete model for optical detection 
(which was assumed almost entirely in the aforementioned 
references) to the continuous model has received little 
attention. In usual procedures, the continuous case is gen- 
erated from the discrete by taking limits of infinitely small 
intervals. Although this procedure can be properly struc- 
tured to generate the continuous version of the counting 
process, the continuous process representing the photode- 
tector output must be viewed entirely as a shot noise process 
(see Section III). Unfortunately, such processes have first- 
order densities that are expressible only through transforms 
of their characteristic function. Hence the building up of a 
general detection model based upon shot noise rather than 
discrete processes would be severely hampered by the inabil- 
ity to express observable statistics. It would appear, how- 
ever, that shot noise detectability cannot continue to be 
avoided when consideration is given to operation with in- 
formation bandwidths on the order of optical detector 
bandwidths. This aspect of detection deserves more atten- 
tion in future research studies. 

V. Analog Communications 

The major portion of work in the area of analog commu- 
nications for optical systems has centered on first- and 
second-moment theory, spectral analysis, and signal-to- 
noise ratios. We have already discussed spectral analysis for 
shot noise processes with emphasis on signal representation. 
For the remainder of the paper we will concentrate on trying 
to bring together some of these ideas in a unified way. lean- 
ing heavily on physical motivation. 

Before turning to the analyses required it is very instruc- 
tive to reconsider the behavior of a photodetector from a 
phenomenological point of view. As we have already seen 
an important parameter in a photodetector is the time A T 
over which the intensity fluctuations remain relatively con- 
stant. This is related to the bandwidth B of the optical signal 
by A T< 1/2 B. When an electron is released from the detect- 
ing surface and flows through the ensuing circuitry, there 
is always the fixed electron charge e. This fixes the area of 
the resulting current pulse. Hence higher energy electrons 
will flow faster, the current pulses will be narrower in time 
resulting in an increased frequency response of the detector. 

Generally, one thinks of counting circuitry as literally 
counting each of these events. On the other hand, one can 


also consider the following viewpoint : suppose we "match" 
the detector response B d to the incident radiation bandwidth 
A T~ 12 B= 1 2 B d . Then each current pulse created will be 
approximately AT seconds wide. Hence at any time I. the 
effects of all pulses from the previous AT seconds will still 
be present. Therefore, if A, electrons flow in the interval 
(f,- — AT. I,), then at the time r, the value of the current can 
be approximated by k^e AT). or since A, ATia/f/,). 
/(/,)=: ae/( I,), which was shown earlier. If the response of 
the detector were square pulses, this description would be 
exact. On the other hand, the distortions occurring due to 
end effects are the normal effects of filtering. The so-called 
shot noise represents the fact that A/ is an integer, making 
7(r,) take on discrete values, whereas the true /(r) would be 
continuous. 

The previous argument was intended to justify considera- 
tion of the (2 Br + I) Nyquist samples for analog processes 
also. It was shown in (55) that these samples can also be 
considered statistically independent. 

1 ) Xtaximizim/ Sic/nctl-to-Noise Ratio for Direct Detec- 
tion: For maximum likelihood detection, the optimum form 
of processing consisted of weighting the counts on each of 
the (2Bt+ 1 ) intervals. We will, therefore, consider the form 
of processing where each kj is weighted by the number )ij. 
The processed signal then becomes r, where 


v 


2 fit 


I Pj*j- 


(67) 


As a criterion for signal processing, we will use the signal- 
to-noise ratio defined as 


£ = E * 2 Hl.vo = 0 
N var [r] 


( 68 ) 


Thus the mean of v in the absence of noise can be obtained 
from (37) and is 

*Mk-o = « I Pj\sjUAT)\*AT (69) 

2 = 0 


with the variance being 


var [r] = x £ /?;'(|s J (;AT)| 2 + N' 0 ) 


; = o 


+ x(N o 2 + 2N' 0 \sj(jAT)\ 2 )AT}AT (70) 


and N' n = N 0 /AT. 

Thus the signal-to-noise ratio becomes 


201 

* I Pj\sj(jAT)\ 2 AT 

2fO 

N ~ 2Bl 

x I Pj{(\sj(jAT)\ 2 + No) + *</Vo 2 + 2No|sj(/AT)| 2 )AT] AT 

2 = 0 

which can be bounded by using the Schwarz inequality. Hence 

£ y x{[s J (jAT )| 2 ) 2 A T 

N jh \sj(jAT)\ 2 + Nq + x(N 0 2 + 2 No |sj( jAT ) | 2 )A T 




( 72 ) 


1624 


PROCEEDINGS OF THE IEEE, OCTOBER 1970 


with the equality holding when 

B = b(-/AT)| 2 

' |s;(yAT)| 2 + No + ct(NQ + 2/Vo|s J ( j /AT)| 2 AT 

Notice that in the absence of noise N 0 = 0 


(73) 


(£) =* « f h(jA7)|*AT = *E S = (74) 

This, however, is the average number of photoelectron 
counts in the (0. f) interval and is generally referred to as the 
quantum-limited signal-to-noise ratio. 

Let us now rewrite the right-hand side of (72) as 



< y *|s;(;AT)| 2 A7- 

~ j=ol +a/V 0 + [l/a|s J (AT)| 2 AT][a/Vo + a 2 ^o} 

< a£ s . 


(75) 


Recall now that <xN 0 is the number of noise counts per AT 
interval and for thermal noise sources is much less than one. 
In addition, ot|sj(,/AT)| 2 AT is the average number of signal 
counts in the jth AT interval. Suppose, therefore, that we 
construct a signal 

£ 

|sj(AT)| 2 = for one value of j 

= 0, for all other values of j. 

Then clearly 

2Bl 

X |s/AT)| 2 AT = E s 

j = 0 


is not violated, and in addition 



= ccEJl + a. N 0 + 


*N 0 + (sN 0 ) 2 


a E. 


(76) 


for all values of %E s >oiN 0 . Thus low duty-cycle operation 
is preferable when maximizing the signal-to-noise ratio of 
detected radiation in the absence of detector noise. 

The addition of independent thermal noise with tempera- 
ture T at the detector output changes the variance in (70). 
After some manipulation to take into account the electron 
charge e , the bandwidth, and the load R , the signal-to-noise 
ratio in (76) can be written as 


limited except under extreme temperature conditions. This 
was true because the current that was released by the sur- 
face immediately encountered a thermal environment. There 
are devices, presently limited to the visible region of the 
spectrum, which impart a preamplification to the photo- 
current before the thermal environment is met. The most 
common device, a photomultiplier tube, consists of a cas- 
cade of stages through which each emitted electron passes 
and is amplified many thousands of times. When the effect 
of an electron emitted at the cathode reaches the anode, it 
appears as an actual current pulse well above the anode 
thermal environment. It is, therefore, possible to view the 
effects of individual electrons. These devices are commonly 
referred to as “photon counters.” 

To first order, one can account for this amplification A 
by assuming an electron charge equal to Ae. Then we can 
see from (77) that the term which previously made the 
device thermal noise limited becomes 

kT 

A 2 e 2 oiE s RB d 

Thus if the gain of the device is such that the inequality 


A > 



kT 

(aE s )RB d 



is satisfied, it is again shot-noise limited. In practice, the 
gain is a random variable and an “excess noise” appears 
because of the finite variance of A. This, however, only 
causes changes on the order of 20 percent or about 1 dB, 
and for the purposes of this discussion can be ignored. 

J) Heterodyne Detection: If the electric field of a local 
oscillator is aligned coincident with the received signal over 
the detector surface, then one can directly add the two 
electric fields. Thus if we designate the signal by £, exp(;'co,t 
+ (p(t)) and the local oscillator by E L0 exp (jto 2 t) then 


Sj(jAT) = £, exp [jto,(y'AT) + (p(jAT)] 

+ E lo exp [iw 2 (j'AT)] 

and 

K 0 ' AT )| 2 = |£,| 2 + | E lo | 2 + 2 |£ 1 ||£ to | 

• cos {(co, - co 2 );AT + <HjAT)}. (78) 


— = otEJ 1 + atN 0 + 


' *N 0 + (otN 0 ) 

a£„ 


kT 

e 2 otE,RB d 


(77) 


The quantity kT/e 2 otE s RB d is, in general, much greater than 
one. Therefore, except under extreme conditions of tem- 
perature, impedance, bandwidth, and signal level, a normal 
detector will be "thermal noise limited" in operation and 
S/N will be much less than <xE s . 

We have been considering the case where each sampling 
interval represented one mode. If, in fact, each interval 
contained L modes, then clearly we need only replace otN 0 
by LxN 0 everywhere. 

2) Direct Detection with Photomultiplication: We have just 
shown that most detectors are inherently thermal noise 


If the local oscillator is made large, then it can be shown 
that, under these conditions, the density in (41) approaches 
a Gaussian density with a mean value of 2a|£,||£ LO | 
cos {(m, -w 2 )jAT + <f>(jAT)) (excluding the dc component) 
and variance *|£ lo | multiplied by the bandwidth considered. 
Then if the bandwidth of the signal in (78) is 2W and the 
bandpass of the detector is greater than (w, — w 2 )-|- W, one 
can pass the detected signal through a bandpass filter cen- 
tered at (co, - w 2 ) with bandpass 2 W and recreate the signal 
2 a£,£ LO cos {(to, — tu 2 )jAT+ <f)(jAT)}. The resulting car- 
rier signal-to-noise ratio will be 

( S \ . i ( 2 al £ 1 l |£ / , 0 [) 2 = a |£,| 2 _ t ,| E ,| 2 

Wso *\Elo\2W fL hvW 


KARP rl at. COMMUNICATION THEORY FOR FREE-SPACE CHANNEL 


1625 


which can again be recognized as the quantum limited con- 
dition. 

4) Power Spectrum Analysis: In Section III it was shown 
that the time-averaged power density spectrum of the cur- 
rent could be written as 

S,(o>) = \H(w)\ 2 [E(N) + S V M]. 

Since S w (co) is the spectrum of a nonnegative definite func- 
tion (the normalized power), it can be written in terms of a 
dc and an ac component. The ac component is S AC (m) where 

SacM = (rine) 2 <S> M (io) 

and n(t) has been normalized to 

n(f) = /i(l + m(f)); m(t ) > — 1 

with 



m(t)dt = 0 


and <P M (w) the time-average power density spectrum of 
m(t). Notice that the modulation index is included in »i(r). 
For an unmodulated source, such as noise. m(f) = 0, and only 
the shot noise term and the dc remain. Thus if we have a 
signal plus additive noise impinging on the detector, where 
the average noise rate is designated n„, the power density 
spectrum minus the dc terms can be written as 

2kT 

S T ((o) = e 2 |//(uj)| 2 [>/(n n + n) + (r/n) 2 <D M (<u)] + — 

K 


where we have also included the thermal noise contribution. 
If we define the signal-to-noise ratio as the ratio of the total 
signal power 


(enn ) 2 ' 

2n J 2 


tia> 


over the bandwidth of the signal, divided by the total non- 
signal power over the same bandwidth : 

+ *■> + 

then, assuming that |f/(«j)| 2 is “flat" over the 2 W region of 
interest 




(79) 


where W is now the cyclic frequency. Notice again that if e 
is replaced by .-It’ and the shot noise term 2t]A 2 e 2 nW 
= 2r]Ael DC W>4kTW , the device will be again shot-noise 
limited. The term 


rjn _ rjP 
2W = 2hvW 


can again be recognized as being related to the quantum 
limited condition. 


VI. Summary Remarks 

We have tried to present in this paper a review of the 
basic concepts in optical communications viewed strictly 
from a classical point of view, in the absence of any channel 
effects. In this vein, we have viewed the received signal as 
an electromagnetic field and described its interaction with 
a photodetector. We then described some of the fundamen- 
tal properties of the resulting current flow as seen by the 
communications engineer. 

The treatment in this paper is not complete, since the 
study of this problem has not finished. Consequently, some 
portions have been given more emphasis than others, while 
some have been omitted entirely. For example, in the litera- 
ture the topic of continuous estimation for shot noise pro- 
cesses has barely been touched [13]. The same is true for 
synchronization in a shot noise environment [31 ]. although 
this will be fundamental to any sophisticated optical com- 
munications system. 

What has been attempted, rather, was a presentation 
which answered the questions concerning the physical 
modelling of the system and a reduction to the terms most 
useful for analysis. Where such analysis had reached a level 
of conveying a reasonably complete understanding of an 
aspect of the problem, it was also presented. It is hoped that 
this paper is thorough enough to motivate additional re- 
search in this area. 


References 

[1] R. J. Glauber. "The quantum theory of optical coherence." Phvs. 
Rer.. vol. 130. pp. 2529 2539. June 1963. 

(2) . "Coherent and incoherent states of the radiation field," Phvs. 

Rev., vol. 131. September 1963. 

(3 ] , "Optical coherence and photon statistics.” in Quantum Optics 

and Electronics. C de Witt, el al.. Eds. New York: Gordon and 
Breach. 1965. pp. 65-185. 

(4] L. Mandel and E. Wolf. "Coherence properties of optical fields.” 
Re v. Mod. Phys.. vol. 37. pp. 231-287. April 1965. 

(5] J. R. Klauder and E. C. G. Sudarshan. Fundamentals of Quantum 
Optics. New York: W. A. Benjamin. Inc.. 1968. 

[6 J G. J. Troup. Optical Coherence Theory. London. England: 
Methuen, 1967. 

[7] W. H. Louisell. Radiation and Noise in Quantum Electronics. New 
York: McGraw-Hill. 1964. 

[8] L. Mandel. E. C. G. Sudarshan. and E. Wolf. Proc. Phvs. Soc.. vol. 
84. 1964. 

[9] E. Fermi. Rev. Mod. Phvs.. vol. 4, 1932. 

[10] V. Fano, Amer. J. Phvs.. vol. 29. 1961. 

[11] H. L. Van Trees. Detection. Estimation, and Modulation Theory. 
Part I. New York: Wiley. 1968. 

[12] S. O. Rice, "Mathematical analysis of random noise." Bell Svst. 
Tech. J.. vol. 23. pp 282-332. 1944. 

[I3J S. Karp and J. R Clark. "Photon counting: a problem in classical 
noise theory," IEEE Trans. Inform. Theory, vol. IT - 1 6, pp. 672-680, 
November 1970. 

[14] D. Slepian and H. O. Poliak. "Prolate spheroidal wave function, 
Fourier analysis, and uncertainty. I,” Bell Svst. Tech. J.. vol. 40. 
pp. 43 64. 1961. 

[15] H J Landau and H. O. Poliak. "Prolate spheroidal wave functions, 
Fourier analysis and uncertainty. II." Bell Svst. Tech. J.. vol. 40. 
pp. 65-84. 1961. 

[16] , "Prolate spheroidal wave functions. Fourier analysis and un- 

certainty, III." Bell Svst. Tech J.. vol. 41. pp. 1295 1336. 1962. 

[17] B. Reiflen and H. Sherman. "An optimum demodulator for Poisson 
processes: photon source detectors." Proc. IEEE. vol. 51. pp. 13 lb- 
1320, October 1963. 

[18] K. Abend, "Optimum photon detection.” IEEE Trans. Inform. 
Theory (Correspondence), vol. 12. pp. 64-65, January 1966. 


1626 


PROCEEDINGS OE THE IEEE. VOL 58. NO. 10. OCTOBER 


[19] T. Kailath. "The divergence and Bhattacharyya distance measures in 
signal selection." IEEE Trans. Commun. Techno!., vol. COM- 1 5, 
pp. 52-60. February 1967 

[20] R. M. Gagliardi and S. Karp. " W-ary Poisson detection and optical 
communications." IEEE Trans. Commun. Tcclmol.. vol. COM-17, 
pp. 208-216. April 1969. 

[21 ] D. Gabor. "Theory of communication." J. Inst. Elec. Eng. (Tokyo), 
vol. 93(111). pp. 429-457. November 1946. 

[22] D. Middleton. Introduction to Statistical Communication Theory. 
New York: McGraw-Hill. I960. 

[23] E. Parzen. Stochastic Processes. San Francisco, Calif.: Holden- 
Day, Inc., 1962. 

[24] A. Papoulis. Probability, Random Variables, and Stochastic Processes. 
New York: McGraw-Hill. 1965. 

[25] Ibid., ch. 10. 

[26] S. Karp and R. M. Gagliardi, "On the representation of a continuous 
stochastic intensity by Poisson shot noise." IEEE Trans. Inform. 
Theory, vol. IT- 1 6. pp. 142-147. March 1970. 

[27] S. Karp, R. M. Gagliardi. and I. S Reed. "Radiation models using 
discrete radiator ensembles." Proc. IEEE. vol. 56, pp. 1704-1711, 
October 1968. 

[28] D. Middleton. "A statistical theory of reverberation and similar first- 
order scattered fields— Part I : Waveforms and the general process: 
Part II: Moments, spectra, and special distributions." IEEE Trans. 
Inform. Theory, vol. IT- 1 3. pp. 372—414. July 1967. 

[29] R. W. Chang. "Photon detection for an optical pulse code modula- 
tion system," IEEE Tram. Inform. Theory (Correspondence), vol. 

* IT-15, pp. 725-728, November 1969. 


[30] I. Bar-David, "Communication under the Poisson regime." IEEE 
Truns. Inform. Theory, vol. IT-15, pp. 31-37. January 1969. 

[31 ] R. M. Gagliardi. "The study of synchronization techniques for op- 
tical communication systems." University of Southern California. 
Elec. Eng. Dept.. 2nd. 3rd. and 4th Quarterly Reps.. April. August, 
and December 1969. 

[32] W. K. Pratt. "Binary detection in an optical polarization modulation 
communication channel." IEEE Trans. Commun Techno/. (Concise 
Papers), vol. COM-14, pp. 664-665. October 1966. 

[33] M. Ross, Laser Receivers. New York: Wiley. 1966. 

[34] J W. S. Liu. Reliability of quantum-mechanical communication 
systems." IEEE Trans. Inform. Theory, vol. IT-16, pp. 319-329, 
May 1970. 

[35] S. Karp el at.. "Error probabilities for Poisson detection." NASA 
Tech. Note TN-D472 1 . October 1968. 

[36] M. Ross. "Pulse interval modulation laser communications." pre- 
sented at the Eastcon Conv.. Washington. D. C.. October 1967. 

[37] S. Karp and R. M. Gagliardi. "A low duty cycle optical communica- 
tion system," presented at the Eastcon Conv.. Washington. D. C., 
October 1967. 

[38] . "The design of a pulse-position modulated optical communica- 
tion system." IEEE Trans. Commun. Teclinol., vol. COM-17, pp. 
670-676. December 1969. 

[39] R. W. Sanders. "Communication efficiency comparison of several 
communication systems," Proc. IRE. vol. 48. pp. 575-588. April 
I960. 

[40] S. Karp. "Communication efficiency of quantum systems." NASA 
Tech. Rep., TR-R-R-320, September 1969. 


0960 / 

670 IEEE TRANSACTIONS ON COMMUNICATION TECHNOLOGY, VOL. COM-17, NO. 6, DECEMBER 1969 

The Design of a Pulse-Position Modulated 

Optical Communication System fi ^ ^ 

SHERMAN KARP, member, ieee, and ROBERT M. GAGLIARDI, member, ieee 


REFERENCE: Karp, S., and Gagliardi, R. M.: THE DESIGN OF 
A PULSE-POSITION MODULATED OPTICAL COMMUNICA- 
TION SYSTEM, NASA Electronics Research Center, Cambridge, 
Mass. 02139, and University of Southern California, Los Angeles, 
Calif. 90007. Rec’d 7/30/69; revised 5 9 69. Paper 69TP47-COM, 
approved by the IEEE Radio Communication Committee for pub- 
lication without oral presentation. IEEE TRANS. ON COMMUNI- 
CATION TECHNOLOGY, 17-6, December 1969, pp. 670-676. 

ABSTRACT: In recent literature the advantages of an idealized 
narrow-width pulse-position modulated (PPM) optical communi- 
cation system, using coherent sources and direct photodetection, 
have been shown. In this paper the practical design of such an oper- 
ating PPM link is considered. System performance in terms of 
error probabilities and information rates is derived in terms of 
key parameters, such as power levels, number of PPM signals, 
pulse width, and bandwidths. Both background radiation and 
receiver thermal noise are included. Design procedures utilizing 
this data are outlined. Whenever possible, optimal design values 
and parameter tradeoffs, in terms of maximizing information rate 
or minimizing transmitter power, are shown. The effect on perform- 
ance of photomultipliers and their inherent statistics is also pre- 
sented. Although the basic analysis is derived in terms of photon 
“counts,” the necessary system optics equations are introduced to 
allow for overall optical hardware design. The primary underlying 
assumption is that synchronization is maintained at all times 
between the transmitter and receiver. 

Introduction 

W ITH THE development of coherent sources in the 
optical region of the spectrum, there has been an 
increasing interest in the design of optical communication 
systems The direct detection of optical radia- 

tion is presently restricted to photodetection surfaces, for 
which it has been shown that the released electrons obey 
Poisson statistics [7], [8]. In this case investigators have 
shown the advantages of using narrow-width pulse-posi- 
tion modulation (PPM) as the principal mode of com- 
munication [3], i.e. , coding information into one of M 
possible signals and transmitting it as a pulse of optical 
energy placed in one of M adjacent time intervals. It had 
been shown Cl3~DD that idealized versions of such sys- 
tems optimize performance in terms of both various “dis- 
tance” criteria and overall error probability, for cases of 
most interest. Therefore, in this paper we shall present 
procedures for the practical design of an M-ary low “duty 
cycle” PPM optical communication systems. In particu- 
lar, performance characteristics in terms of key system 
parameters will be derived, with emphasis on hardware 
limitations and interference effects. 

Consider the PPM optical communication system shown 
in Fig. 1. The transmitter is a monochromatic optical 
source operating at a fixed frequency. Information is sent 


1 P "0 T 0 Iv, 

0£ TEC TOR I 

CURRENT 

INTEGRATOR 



-L / AT 
4, 'o 


9ACKGROUND 

RADIATION NOlSt 


Fig. 1. Maximum-likelihood processor. 

by transmitting one of M signals as a pulse of optical 
energy at the same frequency, located in one of .1/ adjacent 
time intervals, each of which is AT seconds wide. We 
assume that complete synchronization is maintained be- 
tween the transmitter and receiver at all times, i.e., time 
coherent operation. The optical receiver detects the trans- 
mitted signal by attempting to determine the optical 
energy in euch possible time slot, then selecting the signal 
which corresponds to the maximal energy. In direct photo- 
detection this is equivalent to “counting” the number of 
released electrons in each AT interval. Background radia- 
tion entering the photodetector acts as erroneous energy, 
causing signaling errors. In practical systems photomulti- 
pliers are often used to afford an improvement in photo- 
detection (i.e., a gain in numbers of released electrons) 
but unfortunately often behave randomly, complicating 
system design. In addition, additive thermal noise may 
occur after photodetection, which tends to cause further 
errors in signal decisioning. Both of these latter effects 
shall be considered subsequently. 

As optical radiation impinges upon a photodetecting 
surface, a series of electrons are released; each produces 
a current pulse eh(t — i m ), where e is the electron charge, 
t m is the time of release, and h(t) represents the current 
motion. The function h(t) is pulse like, having a time 
width roughly equal to the inverse of the photodetector 
bandwidth. We assume that h(t) is identical for all elec- 
trons, and that the area of h{t) is normalized to unity. 
In the absence of thermal noise the output voltage across 
a resistor R of the normalized current integrator, when 
sampled after AT seconds of integration, is then 

v = ( Ke/AT)R (1) 

where K is the number of electrons released during the 
AT-second integration period. This result neglects “end 
effects;” that is, it assumes that h(t) can be considered 
an “impulse” function with respect to the integrating 
time AT. Thus the integrator sample is proportional to 
the number of electrons released in the preceding AT time 
interval and therefore “counts” electrons. The average 
number of electrons produced in the time interval AT as 
a result of the received radiation from the transmitter is 
denoted Ks and is proportional to the received trans- 


KARP AND GAGLIARDi: PPM COMMUNICATION SYSTEM DESIGN 


671 


mitter energy, i.e., 

K s = vPRAT/hf (2) 


Temporarily neglecting the additive thermal noise and 
taking into account all the ways in which the correct 
interval count can equal r other interval counts, we have 


where 17 is the photodetector efficiency, Pr the received 
peak optical signal power in watts, and/ the mode (trans- 
mitter) frequency, and where h = 6.6 X 10 -34 joules per 
second. The received power can be related to the system 
optics by 


*-1 r _o r + 1 \ r / 1! 

• exp [— (Ks + ~^exp (— Ks) j 


M- I- f 


Pr = ( d/0ryP T L X 10" 4 (3) 

where d is the optical diameter in centimeters, r the range 
in meters, 6 the divergence of the antenna in radians, Pt 
the average transmitter power, and L the transmitting 
optics loss. The peak transmitter power can be converted 
to the average transmitter power by dividing it by the 
number of signaling intervals M. 

Similiarly, we denote by Ky the average number of 
electrons in a time interval A 7’ produced by the back- 
ground radiation received by the optical collector. Thus 

K n = r,P N AT/hf (4) 

where now P N is the received background radiation aver- 
age power. This is generally written as 

Pn = NyA R LMA\ (5) 


exp { — K n ) j + (1/Af ) exp[— (K a + MK N )~]. 

( 6 ) 


Using the identity 

Y ~ 1)1 

h (r + 1)!(M — 1 — r) ! 


M{B/A) 




we can rewrite (6) as 
\{K b + Ks)' 


'-£f 


x! 


exp []— {Ks + 

r yi k ^ r- i r(i + «)"- ni 

L5ir exp<_K * , J L — m — Jl 


where X\ is the background spectral radiance [watts per 
area (solid angle) bandwidth]], L a the optical loss, Ar the 
area of the collector, 0 the resolution of the receiver 
(solid angle), and AX the optical bandwidth. 

Note that with Pn held constant, the average number 
of noise electrons Ky is proportional to AT. This clearly 
indicates the advantage gained bv low “duty cycle” oper- 
ation, i.e., by the use of signal intervals which are as 
narrow as possible to decrease the amount of interfering 
radiation. The minimum value for AT, however, is approx- 
imately 1/AX, for then the assumption of fixed noise power 
Pn is no longer valid. (For optical filters on the order of 
o A, AT widths of 10 _I1 -10~ 13 seconds are feasible.) 

The number of electrons counted in a signaling interval 
(i.e., an interval AT containing transmitter energy) is a 
Poisson random variable whose average value is Ks + Ky. 
For nonsignaling intervals the average value is Ky. We 
have tacitly assumed that the system is synchronized, i.e., 
that the integration occurs exactly during the AT data 
intervals. The maximum-likelihood processing corresponds 
to counting the number of electrons in each of the M 
intervals and selecting the interval with the largest count 
as the proper PPM signal. Allowing for likelihood draws 
(in which case we make a random selection among the 
drawees), the probability of making a correct decision is 

i.e., the probability that the 
correct interval count equals r 
other interval counts and ex- 
ceeds the remaining M— r+l_ 


+ (1/A/) exp [- (K s + MKy)2 (7) 

where 



Tliis result is amenable to computation and can be used 
in system design to obtain performance characteristics for 
il/-ary operation with fixed parameter values. An exem- 
plary plot is shown in Fig. 2, in which error probability 
Pe = 1 — Pd is plotted versus M for fixed values of K s 
and Ky. The results show the degradation in system per- 
formance as M is increased, which can be attributed to 
the increase in the likelihood draws as the number of 
intervals increases. Note that Pe depends upon both Ks 
and Kn and not simply upon their ratio, so that a com- 
plete catalog of Pe curves is required to handle all design 
conditions [11]]. 

Previously, it was stated that the signal intervals AT 
should be as narrow as the optical bandwidths allow. 
This fact can be shown quantitatively by examining P B 
as a function of AT, assuming P„ fixed. This is shown, 
for example, in Fig. 3, where M = 2, and where P n is 
constrained such that the average electron noise count in 
an interval T 0 is 10. The probability of error is plotted 
versus AT/T 0 , the duty cycle of the transmitter. Since 
K s is held fixed throughout each curve, the transmitter 
peak power must necessarily increase proportionally, as is 
obvious from (2) . Note that the error probability decreases 
monotonically as AT decreases. The minimum values at 


Pd = Z l/(r+ 1) 


672 


IEEE TRANSACTIONS ON COMMUNICATION TECHNOLOGY, DECEMBER 1969 



Fig. 2. Error probability versus M. 



Fig. 3. Error probability versus normalized pulse width. 


AT = 0 are shown, but operation at these values implies 
infinite optical bandwidth. This minimum value is pre- 
cisely the probability that a zero count occurs in the 
signaling interval (which is also the noise count of the 
nonsignaling intervals)', and that the receiver randomly 
selects incorrectly. It is also interesting to note that the 
pulse width AT is not particularly significant at low values 
of Ks As Ks increases, however, the duty cycle begins to 
play a paramount role in the resulting error probability. 
Thus attempts to increase bandwidth will have a direct 
payoff in system operation. 

It is generally tempting for communication engineers 
to base system design in terms of signal-to-noise ratios 
(SNR). Typically, this is defined as the ratio of the 
squared average signal electron count to the variance of 
the count with background noise [o]. In terms of our 
previous notation this becomes. 

S/N = Ks'/iKs + K n ). (8) 



Fig. 4. Probability of error for fixed SNR versus 
noise background. 



Fig. 5. Probability of error for ideal photomultiplier. 



Fig. 6. Probability of error for photomultiplier model. 


upon the ratio. It is precisely this point that distinguishes 
the Poisson detection problem from the analogous problem 
of transmitting one of M orthogonal signals over a Gaus- 
sian additive channel of equal noise power. 

It is also of interest to determine how the SNR and 
error probability are related, in general. By solving for 
K s in (8) one obtains 

K s = US/N)\l + [1 + 4Kn/(S/N) ] 1/J | (9) 


To indicate the difficulty in basing design purely upon 
the SNR, consider the following comparison. First, let 
A's = 10 and Kn = 10; then S/N = 5 and, from Fig. 2, 
Pe = 3 X 10~ 2 . On the other hand, consider the case 
when Ks = 5 and Kn = 0. Again S/N = 5, but now 
Pg = 3 X 10- 3 . Thus with less signal energy we have 
improved the detection an order of magnitude with the 
same SNR. This is due to the fact that Pe in (7) depends 
upon signal energy K s and noise energy K N and not only 


which can be inserted into (7). We can then plot Pe 
versus A'.v for fixed values of S/N and .1/, as shown in 
Fig. 4. For large values of Kn, Pe approaches erf ( S/2N ), 
which is identical to the error probability of orthogonal 
coherent signaling in Gaussian noise with a SNR of S/N. 
Since this approach is asymptotic from below, the Gaus- 
sian case will always be inferior to the Poisson case for 
the same SNR. This point is also in agreement with re- 
sults showing the equivalence of optimum Poisson process- 


KARP AND GAGLIARDi: PPM COMMUNICATION SYSTEM DESIGN 


673 


ing and optimum Gaussian processing for large values of 
background noise power [1], [4]. Notice again that to 
maintain the same SNR, Ks must be increased. Lastly, 
we note that even if noise background is negligible, i.e., 
K N — » 0, the signaling error probability does not go to 
zero but rather approaches 

P E = $exp( — K s ). (10) 

Ks-o 

This is the same as the minimum values (at AT = 0) 
shown in Fig. 3. 

Effect of Thermal Noise and Photomultipliers 

So far in our analysis we have neglected the effect of 
additive thermal noise, which adds a random variable to 
the integrator output sample of (1). This complicates our 
original assertion that the receiver counts exactly the 
number of electrons at the photodetector output. Since 
this is the crux of the maximum-likelihood direct detec- 
tion system, it would be worthwhile to investigate this 
problem in more detail. Suppose, for example, that we 
consider the current resulting from the flow of a signal 
photoelectron during a time interval AT = 10~ 9 seconds. 
The sample value at the integrator output across a 50-S2 
resistor from this electron would be 8 X 10 -9 volts. Now 
if the receiver operated at room temperature, the thermal 
additive noise would contribute an integrator noise volt- 
age whose root mean square (rms) value is approximately 
28 X 10 s volts (R = 500, temperature = 300°). Clearly, 
the count of a single electron could not easily be made in 
such a poor signal and noise condition. Photomultipliers 
exist, however, which effectively amplify the current 
effect of each photoelectron, resulting in a larger photo- 
electron count at the integrator output. Let this amplifica- 
tion factor be A, so that each photoelectron contributes a 
current value Ae/AT at the time of sample. We would 
like to determine the effect of A on Pd in (7) when 
Gaussian white thermal noise of one-sided spectral level 
N 0 is added at the integrator input. If we assume that 
each electron receives the same photomultiplier gain A, 
then the sample value due to the photodetector output 

v = ( KeA/AT)R (11) 

where K is again the number of electrons produced during 
the interval AT. With Poisson statistics for the electron 
count during the AT interval, the probability density of 
v is then 

P(v) = X l(R’/j'-) exp (-£)]«(«; - jeApR) (12) 

>-o 

where S(x ) is the Dirac delta function, P = 1/AT, and 
R is the average value of K. The thermal noise is inte- 
grated by the integrator and adds to the integrator sample 
v a random variable that is Gaussian distributed with 
zero mean and variance N<fi. Thus the total integrator 
sample value z after AT seconds of counting has a prob- 
ability density obtained by convolving the Gaussian den- 


sity with the discrete density in (12), yielding 


P.{z,R) = £ [(R’/j'-) exp (-R)]G(z,jAePR,NoP) 

r-o 

(13) 


where G(a,b,c) denotes a Gaussian density in the variable 
a with mean b and variance c. Observe that the sample 
probability densities are now continuous densities, and 
that the probability of equal sample values occurring is 
zero; that is, there is a zero probability of likelihood draws. 
Now the average count K is Ks + K N when a signal is 
present in the AT interval and is Ks when the signal 
is absent. Therefore, the probability of a correct decision 
is simply the probability that the observable z after the 
correct interval exceeds the observable z after the M — 1 
remaining intervals. Hence 


Pd = J dz p,(z,Ks + Ks)^f p,(y,K N ) dyj 


This can be written more compactly, as 

n ^ (K s 4- Ks) ’ r /tr , 

Pd = 2^ r exp [ — (Ks + Ks) J 


M- 1 


(14) 


r-o 


where 


+« ' 5 


• J dz G(z,n,<r i )\l/ M l (z) (15) 

* —CD 

1 + |[T exp(_ H erf (^)l 


p = AePR 
a 1 = NS 


and erf (x) is the error integral. Equation (15) has been 
evaluated for several values of K s and Ks and is shown 
in Fig. 5 with P = 10 9 Hz and N 0 corresponding to a 
noise temperature of 300° with a 50-0 load. The asymp- 
totic values for large photomultiplier gains are precisely 
the values obtained by (7). At low gains, however, the 
thermal noise becomes the dominating source of error, 
and the probability of error increases rapidly. Note that 
to overcome the thermal environment a photomultiplier 
gain of about 10 4 is necessary for all the operating condi- 
tions shown. For other thermal environments one can use, 
as a rule of thumb, A > (>00(temperature) 1/2 , obtained by 
directly scaling the foregoing results. 

In the previous analysis we have assumed that the 
multiplier gain was a constant; that is, it was the same 
for all photoelectrons. In practice, however, the gain itself 
is generally a random variable [10] with a variance or 
“spread” which is usually taken as a percentage of the 
mean gain. We would now like to recompute Pd under 
this situation. If we let A, be the electron gain of the 
fth released photoelectron, then the integrator sample 


674 


IEEE TRANSACTIONS ON COMMUNICATION TECHNOLOGY, DECEMBER 1969 


value for K electrons is 

• K 

v = Z A,«/9S (16) 

<-i 

where each A> is an independent random variable with 
probability density P A (X). The probability density of v 
is then obtained from 

PW=EpWW) (17) 

k - o 

where p(v/K) is the conditional density of v, given K 
photoelectrons. Hence from (16) 

p{v/K) = (1 /epR) K [P A {v/e(}R) ® P A (v/e(3R)J (18) 

K-l 

where ® k denotes /f-fold convolution. Let us assume that 
P A (X) is Gaussian with mean A and standard deviation 
aA/2, where 0 < a < 1 represents a percentage of the 
gain. Then (18) is a Gaussian random variable with mean 
KAefiR and variance K\_(aA /2)efiR~¥ so that (17) be- 
comes 

P(v) = Z (^*A0 ex P (— R)G\v,iAePR,i[_{aA/2)ePR'] 2 \. 

•—o 

(19) 

If one adds the sample contribution due to the thermal 
noise, then the sample z has a probability density 

P.(*,R) = Z 
»-0 

•exp ( — fi)G(v,iAe/3R,i[(aA/2)e/3R] 2 + No/3). (20) 

Note that (20) is identical to (13), except for the vari- 
ance terms in the Gaussian densities, and that it special- 
izes to (13) for a = 0. Hence the probability of detection 
is given exactly by (141, with the variance <x 2 replaced by 
this variance. The resulting error probabilities are shown 
in Fig. 6 as a function of the spreading parameter a, 
using parameters as in Fig. 5. Although the results vary 
somewhat as a function of the signal and noise, one ob- 
serves that for mean gains between 10 4 and 10 5 the error 
probabilities obtained earlier (Fig. 2) are valid with gain 
spreads as high as 30 to 40 percent. Even with spreads 
as high as 70 percent, the results indicate only about a 
factor of 2 increase in error probability. The primary 
conclusion, then, is that with suitably high-mean photo- 
multiplier gains, system error probabilities can be basi- 
cally divorced of additive thermal noise effects. In this 
case the error curves plotted in Fig. 2 represent the 
overall system error probabilities. 

The previous results also imply that a device average 
gain-to-spread ratio should be us large as possible for best 
operation. Consider an idealized photomultiplier charac- 
terized as a Poisson branching process [10} Every photo- 
electron emitted from the photoemissive surface impinges 
on the first stage of the device and releases K secondary 


electrons which are Poisson distributed in number with 
parameter 5i. The secondary electrons are then focused 
on the second stage where the same effect occurs. This 
process continues on through n stages, resulting in a large 
electron flow at the anode for each photoelectron emitted. 
The distribution of electrons at the anode output is quite 
complicated but is unimodal and quite easily fitted with 
a Gaussian distribution, as we have already done. In our 
calculations of error probability the two important param- 
eters emerging were the mean gain of the device A and 
the mean gain to rms gain ratio T. For the idealized 
photomultiplier the two parameters can be calculated in 
a straightforward manner and are, in our previous nota- 
tion, 

A = f[ 5. 

*-l 


_ 2 _ / 5, Y 

a \(1 + l/&t -f- I/6263 + 1/626364 + •••)/ 

Notice that F is almost completely characterized by the 
first-stage gain 63 (a minor contribution is also made by 
the second stage). Thus we can relate a directly to the 
gain of the first stage by 

a ^ 2/5, 1 ' 2 . 

For photon counting, Fig. 6 indicates that a < 0.4, or that 
2/5i 1/2 < 0.4 => 61 > 25. 

To take into account the effects of the remaining n — 1 
stages, assume 62 = 63 = • • • = 6„ = 5. Then 


Z (!/«)' = 

»-o 


1 - ( l /6)" +1 1 

1 - (1/6) < 1 - (1/6) 


6 


5 - 1 ’ 


Therefore, 5i should be increased by 5/(5 — 1). Typically 
5 = 4, so that 

5, > 25-| = 33. 


Information Rate of a PPM System 

We have so far analyzed only one aspect of system per- 
formance, i.e., error probabilities. The actual information 
rate that the link achieves is another important design 
consideration. As stated, the transmitter sends optical 
energy in one of M time intervals, which is A T seconds 
wide, thereby transmitting one of M possible signals in 
MAT seconds, or at a rate log2 M/MAT bit/s. The re- 
ceiver correctly determines the true signal with prob- 
ability 1 — Pb and is in error with probability Pe Because 
of symmetry the erroneous signal may be equally likely 
interpreted as any of the M — 1 incorrect signals. Thus 
the overall channel may be depicted as an 4/-ary sym- 
metric channel, in which each of the M possible transmitter 
signals is converted to itself with probability 1 — Pb 
and converted to each other signal with probability 
Pb/(M — 1). The information rate for such a channel 


KARP AND GAGLIARDi: PPM COMMUNICATION SYSTEM DESIGN 


675 


is known to be 


_ log 2 M-\-Pe logs C/ 5 e/ (M — 1) J + (1 — Pe) logs ( 1 — P e) 

~ MAT ‘ 

( 21 ) 

For convenience we shall denote this as 

H = C(K s ,K„,M)/MaT (22) 

to emphasize the dependence of the numerator on the 
stated parameters. By using (22) and the families of error 
probability curves as in Fig. 2, the rate II can be evaluated 
by straightforward substitution. Although specific curves 
for such a computation are not shown here, it suffices to 
note that if K s and K N are such that Pe < 10 \ then 
(22) is, to a good approximation, 

« ( 1 - P B )U\og«M)/MAT'\ 

= (log 2 M)/M\T - P £ [(log 2 M)/M AT~\. (23) 

If we interpret the rate II as the source rate minus the 
«quivocation of the channel, then the PPM optical system 
behaves approximately as if a source rate of log .1/, MAT 
is passed into a channel of equivocation P E \ugM/MAT. 
As noted in (10), even if Kn — >0 (no background inter- 
ference), Pe — » exp (—Ks)/'2, so that the equivocation is 
not due entirely to the background noise. 

The use of (22) and the previous equations are helpful 
in determining the rate, given operating parameters. How- 
ever, the converse design problem, which is to determine 
particular parameter values that achieve a desired rate, 
is not so straightforward. This is due to the fact that the 
rate is a somewhat complicated function of the param- 
eters. We shall consider here two aspects of this design 
problem that have practical application under certain 
operating conditions. First, the word period T = MAT is 
held fixed while the information bandwidth /3 = 1 /AT is 
allowed to vary; and second, the system bandwidth 0 is 
held fixed while the word period is allowed to vary. In 
both cases we are interested in the relationship between 
the rate H and the transmitter parameters K s and M, 
assuming that the noise power is held fixed. 

Fixed Work Period 

We assume here that AT is allowed to vary with M so 
as to maintain T = MAT constant. Thus the system 
“squeezes” more signals into the T-second period as M 
increases. The resulting rate is then 

H = C(K s ,K„t/M,M)/T (24) 

where K NT is the noise energy in T. Thus the rate depends 
only upon the numerator of (22) . With K s fixed, increasing 
M increases the source rate, but the error probability also 
increases and eventually reaches an asymptotic value of 


= (.+ 


Ks[Knt — 1 + exp ( — K^r)] 


K 


NT 




exp (~K S ) 


for large .1/. The resulting system rate increases, to within 
a constant of the entropy of the alphabet, logo M/T. 
Therefore, it is clear that if the bandwidth is expendable, 
one will always increase the system rate for large M by 
increasing M. In a practical system this implies that one 
should operate with as wide a bandwidth as possible to 
fully exploit the capability of the PPM system. We are 
therefore led naturally to consider the design of a system 
for an arbitrary rate H, when the full bandwidth (1/A7’) 
of the system is limited. 

Fixed Bandwidth 

In this case AT is held constant (thereby fixing the 
noise energy K\ in A 7’) so that both the numerator and 
denominator in (22) depend upon M, and the rate de- 
grades quickly as M increases due to the log M / M depend- 
ence. A given rate, e.g., II 0 , may be obtained by many 
different combinations of Ks and M. Analytically, these 
equivalent operating points may be obtained graphically 
by noting that they are the values for which the numer- 
ator C(Ks,Kn,M ), considered as a function of M, inter- 
sects the straight line II 0 ATM. By plotting these functions 
for various Ks their intersection will identify (Ks,M ) 
pairs which achieve the rate H 0 . One may then decide on 
a particular operating point by invoking suitable design 
criteria. For example, one may select the smallest M from 
among the candidate pairs, which then minimizes the 
word period T = .1/A7’. Alternatively, one may choose to 
minimize the average transmitter power per information 
bit, which is proportional to K s /C. In the latter case, 
therefore, one would select the operating pair ( Ks,M ) for 
which Ks/C is minimal. An application of this procedure 
is given in the next section. 


An Example: Real-Time Television 
from Deep Space 

To illustrate the design procedures outlined in this 
paper, we will consider a television system which will 
transmit in real time from deep space. System parameter 
values will be chosen to allow us to use the previously 
derived data and will not always reflect the optimum 
values or current state of the art. We shall assume the 
following transmission parameters: 


background 
distance 
optical loss 
receiver diameter 

receiver temperature 
optical bandwidth 
quantum efficiency 
resolution 

photomultiplier gain 
optical frequency 
signal bandwidth 


blue sky 

4 X 10 8 km 
50 percent 

16 meters (nondiffraction 
limited) 

300° 

5 A 

20 percent 
1 arc second 
>10 5 
5000 A 
10 9 Hz. 


For real-time television a rate of approximately 7 X 10 7 
bit/s is required (corresponding to better than 2 samples 


676 


IEEE TRANSACTIONS ON COMMUNICATION TECHNOLOGY, DECEMBER 1969 


TABLE 1 


A's 

.If 

Ks/C 

0.8 

19 

0.593 

0.9 

23 

0.552 

1.0 

28 

0.506 

1.2 

35 

0.494 

1.4 

42 

0.484 

1.6 

47 

0.488 

1.8 

52 

0.496 

2.0 

55 

0.508 

5.0 

95 

0.792 


per information hertz and 7 bits per sample coding). Our 
objective is to determine design parameters for a PPM 
system that uses minimal average transmitter power and 
a bit error probability no greater than 10~ 4 . Using this 
list of parameters in (4) and (5) yields Ky = 0.01 as the 
noise background count. Following the discussion in the 
previous section we plot C(Ks,0.01,il/) as a function 
of M, and determine the intersection with the line 
(Ht>hT)M = (7 X 10“*hl/, yielding the tabulation in Ta- 
ble I. The minimal average transmitter power occurs when 
K a =1.4 and M = 42 which defines the PPM system. 
Using (2) and (3), the transmitter then requires 0.072 
watts. The corresponding P E can be obtained from (7) 
to get the word error probability (2.4 X 10 _l ), for which 
the corresponding bit error probability is approximately 
Pb/2 ~ 10~‘ [9]. Thus the bit error is larger than that 
desired and in fact will be further increased by the ther- 
mal noise, as evidenced by the data of Fig. 5. (This figure 
shows a slight increase in P E for Ks = 10 and M = 2 at 
photomultiplier gains of 10 4 , and one can conclude the 
situation will be worse for Ks = 1.4 and M = 42). Thus 
the minimal power condition is not sufficient to obtain 
the desired P E for this example without coding. 

To achieve the desired bit error probability, we note 
from Fig. 2 that a system with Ks = 10 and M = 100 
yields bit error probabilities P E / 2 » 1(U 4 at the same 
noise level. We would expect no appreciable degradation 
from thermal noise, and Fig. G indicates that only slight 
increases occur even with photomultipliers having spread- 
ing as high as 45 percent. The transmitter power, which 
no longer is minimal for the desired information rate, is 
found to be 0.25 watts, an increase of 5.4 dB over the 
minimal average conditions. 

Conclusion 

In this paper we have considered some design aspects 
of an optical M-ary PPM communication system using 
photon counters (photodetectors followed by current inte- 
grators) at the receiver. The system considered transmits 
monochromatic optical energy in one of M time intervals, 
and the receiver determines the photon count (i.e., the 
received energy) in each interval and performs a maxi- 
mum-likelihood test to determine which signal is being 


received. Complete time synchronization is assumed to be 
maintained at all times. Performance characteristics are 
derived in terms of system parameters with both back- 
ground radiation and thermal noise interference. In par- 
ticular, it is shown that, unlike the case of pure Gaussian 
additive noise, system performance does not depend on a 
few key parameters but must be recomputed for different 
operating points. The important equations for deriving 
these characteristics are introduced. From these equations, 
design procedures are outlined which lead to best choices 
of transmitter power, numbers of signals, interval lengths, 
etc., in order to obtain desired error probabilities and 
information rates. In particular, we have indicated the 
requirements imposed upon the individual components 
and have delineated the strong and weak points in the 
overall system. No attempt has been made to estimate 
the cost or weight in building such a system. This in fact 
would be premature since the technology required is in 
its infancy and is undergoing swift and quite radical 
change. In addition, we have not included any discussion 
of the methods to maintain link synchronization nor any 
consideration of atmospheric effects other than a loss 
factor. 

Acknowledgment 

The authors would like to thank M. G. Hurwitz and 
H. Asai for providing computational assistance. 

References 

[1 ] B. ReifTeti and H. Sherman, “An optimum demodulator for 
Poisson processes: photon source detectors,” Proc. IEEE, 
vol. 51, pp. 1316-1320, October 1963. 

[2] K. Abend, “Optimum photon detection,” IEEE Trans. In- 
formation Theory (Correspondence), vol. 12, pp. 64-65, January 
1966. 

[3] It. M. Gagliardi and S. Karp, “Jf-ary Poisson detection and 
optical communications,” IEEE Trans. Communication Tech- 
nology, vol. COM-17, pp. 208-216, April 1969. 

[4] C. W. Helstrom, “Quantum limitations on the detection of 
coherent and incoherent signals,” IEEE Trans. Information 
Theory, vol. IT-11, pp. 482-490, October 1965. 

[5] W. K. Pratt, “Binary detection in an optical polarization 
modulation communication channel,” IEEE Trans. Communi- 
cation Technology (Concise Papers), vol. COM-14, pp. 664-665, 
October 1966. 

[6] S. Karp, “A statistical model for radiation with applications 
to optical communications,” Ph.D. dissertation, Dept, of 
Elec. Engrg., University of Southern California, Los Angeles, 
Calif., 1967. 

[7] B. M. Oliver, “Thermal and quantum noise,” Proc. IEEE, 
vol. 53, pp. 436-454, May 1965. 

[8] L. Mandel, “Fluctuations of light beams,” in Progress in 
Optics, E. Wolf, Ed. New York: Wiley, 1963. 

[9] A. J. Viterbi, Principles of Coherent Communication. New 
York: McGraw-Hill, 1966. 

[10] H. J. Gale and J. A. B. Gibson, “Methods of calculating the 
pulse height distribution at the output of a scintillation counter,” 
J. Sci. Inst., vol. 43, no. 4, pp. 224-228, 1966. 

[11] S. Karp, M. G. Hurwitz, and It. M. Gagliardi, “Error prob- 
abilities for maximum likelihood detection of M - arv Poisson 
processes in Poisson noise,” NASA, Tech. Note TN-D-4721, 
October 1968. 


Sherman Karp (M’62), for a photograph and biography please 
see page 216 of the Aprd, 1969, issue of this Transactions. 


Robert M. Gagliardi (S’57-M’61), for a photograph and biography 
please see page 216 of the April, 1969, issue of this Transactions. 


Reprinted by permission from IEEE TRAN'S ACTIONS ON COMMUNICATION TECHNOLOGY 
Vol. COM- 17, No. 6, December 1969 
pp. 670-676 

Copyright 1970, by the Institute of Electrical and Electronics Engineers, Inc. 

Printed in U.S.A. 


142 


y? 7 0~X7'Mt> 

IEEE TRANSACTIONS ON INFORMATION THEORY. VOI.. IT-16. NO. 2. MARCH 1070 

On the Representation of a Continuous Stochastic 
Intensity by Poisson Shot Noise 

SHERMAN KARP, member, ieee, and ROBERT M. GAGLIARDI, member, ieee 


Abstract — In many applications a Poisson shot noise (PSN) 
process is said to statistically “represent” its intensity process. 
In this paper an investigation is made of the relationship between 
a PSN process and its intensity, when the latter is a sample function 
of a continuous stochastic process. The difference of the moments 
and the mean-square difference between the two processes are 
examined. The continuity assumption on the intensity permits 
the development of a sequence of moment relationships in which 
the effect of the PSN parameters can be seen. The results simplify 
and afford some degree of physical interpretation when the com- 
ponent functions of the PSN are “rectangular,” or when the in- 
tensity process does not vary appreciably over their time width. An 
integral equation is derived that defines the component function 
that minimizes the mean-square difference between the two pro- 
cesses. It is shown that a “degenerate” form of component function 
induces complete statistical equality of the two processes. The 
problem has application to optical communication systems using 
photodetectors. 

I. Introduction 

I N THIS PAPER a study is made of the relationship 
between a Poisson shot noise process and its inherent 
intensity when the latter is a sample function of a 
continuous stochastic process. The problem is of interest 
in certain applications where a shot noise process is used 
to “represent” the intensity process. When the intensity 
is a deterministic time function, the relations between the 
shot noise and its intensity are well known [1]— [3]. However, 
when the intensity is itself a stochastic process, the 
manner in which the statistics of the intensity and shot 
noise are related is somewhat vague. 

The problem has primary application to optical com- 
munications where shot noise processes are generally 
accepted as models for the output of wide-band photo- 
detectors. In such models, the intensity of the received 
radiation impinging on the detector surface becomes the 
intensity parameter of the shot noise. When the received 
intensity corresponds to a desired modulating process 
(e.g., if the optical transmitter is intensity modulated with 
the process) a question then arises as to the context in 
which the detector has “demodulated” the input radiation. 

Manuscript received April 1, 1069; revised August 21, 1069. 
This work was sponsored by the National Aeronautics and Space 
Administration, under NASA Contract NGR-05-0 18-004. This 
grant is part of the research program at NASA’s Electronics Re- 
search Center, Cambridge, Mass. 

S. Karp was formerly with the Department of Electrical Engineer- 
ing, University of Southern California. He is presently with NASA 
Research Center, Cambridge, Mass. 02139. 

R. M. Gagliardi is with the Department of Electrical Engineering, 
University of Southern California, Los Angeles, Calif. 90007. 


Other applications of similar shot noise modeling involve 
radiation scattering and perturbation effects [4], [5]. 

II. Moment Differences 

Let P(t u t 2 ), t 2 > ti indicate the number of random 
events occurring in the time interval (t„ t 2 ). The number 
of events is Poisson distributed over (t u t 2 ) if the prob- 
ability of k events in (f,, t 2 ) is given by 

[prP(<„ f,) = fc] = ’nCOdf] 

•exp ft dt~^/k\ (1) 

where n(t) is an integrable nonnegative function and is 
called the intensity of the events. A Poisson shot noise 
(PSN) process is then defined as 

m = '"if h(t - D (2) 

m-1 

where h(t) is the component function of the process and 
| /„ | is a sequence of independent random variables, each 
having probability density 



It is well known [1], [2] that the semi-invariants of the 
first-order density of the process 7(f), at any t, are given by 

X„(f) = J h'(t - x)n(x) dx. (4) 

When the intensity n(t) is a sample function of a stochastic 
process, the semi-invariants are themselves random pro- 
cesses, and the statistics of the process I (/) have a compli- 
cated relation to those of the process n(t). By making use 
of some practical assumptions concerning n(l) and h(t), 
we can, nevertheless, derive some properties of this 
relationship. 

Let «(<) be a sample function from a continuous non- 
negative real bounded stationary random process N. (By 
continuous we mean every sample function is everywhere 
continuous, almost surely. By bounded we mean at every 
t, n(t) is a bounded random variable.) Let the component 
function h(t) be nonnegative real, time limited to r seconds 
(i.e., h(t) = 0 for all t outside (0, r), and h k (t) is integrable 
over (0, r) for all k. Then the continuity of the process N 


KARP AND GAGL1ARD1: REPRESENTATION OP CONTINUOUS STOCHASTIC INTXN8ITT 


143 


allows us to apply the mean-value theorem for integrals 1 
to rewrite the conditional (on N) semi-invariants as 

X. = n(t) [ h'(t - x)dx • t - r < t < t (5) 

Jt-T 

where we have dropped the t variable from the X, descrip- 
tion for convenience. The first-order moments of the PSN 
process I (t) can now be directly related to the process N . 
We write the general fcth moment as 

£[/«)]* = £(£[/*«) M) (6) 

where E is the expectation operator and E(-\N) is expecta- 
tion conditioned upon N. Using the relation between semi- 
invariants and moments [7], the conditioned expectations 
in (6) can be obtained from (5). Then substituting into (6), 
and using the stationarity of the process N, yields the 
sequence of moment relations: 

£[/(/)] = E(N) tl 

E[I\t )] = E(NV> + E(N)^ (7) 

E[I 3 (t)] = £(A 3 )<? + E[n(i t )n(t2)]^2 + E(N)n 

where E(N k ) is the fcth moment of the process N, t u t 2 , • • • , 
are points in (t — r, t) , and 

t. = [ h‘(x) dx. (8) 

J Q 

The lack of a general term relating moments and semi- 
invariants prevents us from writing an expression for the 
general fcth moment. However, it is evident that the kth 
moment can be cast in the form 

£[/*(*)] = E(N k )t\ + D(k). (9) 

In general, D{k) represents a summation of terms in which 
each term is of the form 

C(k, m, q)E[n < (i m )n i (t t )} tmt , (10) 

where t + j < k — 1. The C(k, m, q) are positive constants 
generated from the expansion in (7). Equation (9) repre- 
sents a general expression relating the moments of the 
PSN process to the statistics of N, in compliance with 
the aforementioned assumptions. It is clear that this 
relation depends not only upon the moments of N, but also 
upon its second-order statistics as well. Note that the 
component functions h(t) enter the equation through both 
!e<| terms and the {?/} parameters. 

An important result concerning the process signal-to- 

* The theorem referred to here is sometimes called the second 
mean-value theorem for integrals [6]. It states 

/ f(x)g(z)dx -/(«)/ g(z)dz a < t < b, 

• • 

and requires only the continuity of /, the poeitiveness of g, and the 
integrability of g and fg. 


noise ratio can be stated from (9). If we define the process 
SNR as the ratio of the square of its mean to its variance, 
we have 


(SNR), 4 


E*[I(1)] \E(N) (i y 

var [/«)] E(XV, + E(N) t , ~ [E(N) fl ]* 


E 2 (.Y)/v ar (N) 

= 1 + [£( A> 2 /*, var (A)]' 


< (SNR)*. 


(ID 


where (SNR)* is the signal-to-noise ratio of the intensity 
process N. Hence, the SNR of the PSN process is always 
less than that of its intensity process. We make this point 
mainly because the above definition of SNR is commonly 
used in assessing signal quality in communication system 
analysis. 

For convenience, we can normalize (9). Define the 
intensity of the PSN process to be normalized by the 
factor €j, and denote the resulting PSN process by 
That is, we consider the normalized PSN process I 0 (t) 
whose intensity process has sample functions 



( 12 ) 


For this case, (9) becomes 

£[/;(/)] = E(N k ) + D 0 (k) (13) 

where D 0 (l) = 0, D 0 ( 2) = E(N)t t /t u D 0 ( 3) = (£[«(*,)» 
• (fj)]< a + E(N)t 3 \/f„ etc. That is, D a (k) is a normalized 
form of D(k) and, in general, contains terms identical to 
those of D(k) in (10), divided by the factor t\*’. When 
written as in (13), D 0 {k) represents the difference between 
the moments of process N and the moments of the normal- 
ized PSN h(t), the latter having intensity process N a 
in (12). If the PSN is to “represent” the process N in 
the A;th moment, then D 0 (k) should be “small” compared 
to that moment. Consider, for example, the relation of the 
mean-square values of / 0 (f) and n(t). In this case, D 0 ( 2), = 
E{N)tt/ti and minimization of D 0 ( 2) requires minimiza- 
tion of e 2 , which depends only upon h(l). We may then 
inquire if there is one component function h(t), 0 < t < r 
that will minimize t 2 for a fixed *,. By straightforward 
application of calculus of variations, using Lagrange 
multipliers, we obtain the solution 

h(t) = tjr 0 < t < t 

= 0 elsewhere. 


That is, a “rectangular” function spread over the interval 
(0, r). The solution in fact minimizes « n for all n > 2, but 
we must not hastily conclude that the rectangular com- 
ponent function minimizes moment differences D 0 (k) for 
all k. The rectangular function is, however, of interest not 
only for the above reason, but also because it simplifies 
(13), allowing further insight. Define d = «,/r. Then 
t, = (Tt, = n(f)d"T, = f 2 = ? 3 = • • • = ?, where 
t — r < 9 < t. This allows us to substitute into (10) the 
identity E[n’(f)n'(f)] = £(iV ,+ ’), for all t, ;, which in turn 


144 


IEEE TRANSACTIONS ON INFORMATION THEORY, MARCH 1970 


allows us to rewrite (13) as 


E[ll(t)] = E[N k ] + £ C(k, i)E(N k -')<f 


E[N" ] 


*-l 


£ c(k, w'k 

i + -^ 


E(N k ) 


= E[N'][1 + D' 0 (k)] 


(15) 


where D' 0 (k) denotes the right-hand bracketed term and 
the C(k, t) are again positive constants (actually combina- 
tions of the previous C{k, q, m) terms). Equation (15) 
allows us to conclude that the kth moment of the normal- 
ized PSN process, with rectangular component functions, 
will be approximately equal to the kth moment of the 
process N only if D' 0 (k) « 1. The term D'(k) can be 
evaluated knowing just the first k moments of N, except 
the constants C(k, i) are not known in a general closed 
form. 1 Since the intensity process N is nonnegative, we 
can make use of well-known properties of absolute 
moments [8] to establish 


• £[N*] > [E(N k ~ i )] k/i ~ i -= £[AT‘- < ][E(lV*- , )] ,v ‘~ < 


> E(AT*-)[£( AT)]’. 


Substituting into (15) then implies 


Dm < 


C(fc, i)d' 

k {E(N)\ 


(16) 

(17) 


and an upper bound on D’ 0 {k) is seen to be inversely 
related to the average value of N. For 


» [C{k, i)] u< all i < k, 


(17) is approximately 


Dm < 


k(k - l)d 

2 E(N) 


(18) 


where the bound is taken as simply the first term of the 
sum. This result implies that the kth moment of I 0 (t) is 
approximately equal to the kth moment of N, for all k for 
which 


Em Hk - 1 ) 
d 2 • 


(19) 


Note that E(N)/d = rE(N)/t l = rE(N 0 ) = (average 
number of occurrences in t seconds). Thus, (19) essentially 
states that the “denseness” of the shot events (i.e. , the 
average number of component functions occurring in the 
time interval of one component function) must be suffi- 
ciently large for moment representation. The right side 
of (19) serves as a rough rule of thumb for determining 
how large this denseness must be for approximate equality 
of the kth moment. It may be recalled [3] that for PSN 
with deterministic intensities, a condition of large number 


1 It can be seen, however, that C(k, 1) =» k(k — l)/2, by noting 
the relation between semi-invariants and moments. We use this 
fact in (18). 


of occurrences is required before the PSN process loses its 
“discrete” nature. Equation (19) can therefore be inter- 
preted as the statistical equivalent of this statement, i.e., 
the condition under which the PSN begins to take on the 
statistics of its intensity. 

Note that (15), though strictly valid only for k(<) 
rectangular, is also approximately valid if t is much less 
than the time variations in n(t), no matter what the 
component function is, since we would then be able to 
approximate with n((,) « n(t) for all ( — r < t, < t. 
Thus, (16)— (19) are basically applicable for all applica- 
tions in which the time variations in n(t) do not change 
appreciably over the time width of the component func- 
tions. In fact, under the latter assumption, we may even 
remove the condition of stationarity on N, and (15) can 
be used with E(N k ) replaced by k'[n‘(()]. From this, we 
can conclude that the above equations are valid for non- 
stationary continuous intensities and arbitrary component 
functions, so long as the component functions are suffi- 
ciently narrow in time width. 

III. Degenerate PSN Processes 

The previous results for rectangular component func- 
tions allow an additional interpretation. Let us examine 
a “degenerate” situation in which w r e let d, the rectangular 
function height, become arbitrarily small. We then note 
from (17) that as d — » 0, D' 0 (k) — * 0 and 

TOO] - E(N k ) all k. 

Thus, all the moments of the degenerate process I 0 (t) 
become identical to the moments of the process N. Fur- 
thermore, since the process N is bounded, the moment 
principle of random variables [8] guarantees that, in fact, 
as d — * 0, the first-order probability density of I 0 (t) con- 
verges to that of the process N at every t. The limiting 
condition d — » 0 implies that the component rectangular 
functions degenerate to zero in amplitude, while the 
intensity of the 7 o (0 process, given by n(t)/dr, becomes 
infinite. That is, the component functions get “smaller,” 
but their rate of occurrence, in forming the sum in (2), 
increases without bound. Loosely speaking, the functions 
become more “densely packed,” and the PSN behaves 
more like a continuous process rather than a discrete 
process. 

The actual manner in which the processes behave can 
be seen by investigating the conditional probability densi- 
ties of I 0 (t) for d S3 0. Since the semi-invariants K are of 
order 0(<T) for n > 1, we can establish that the conditional 
first-order probability density of I a (t) approaches a 
Gaussian density with mean n(t,) and variance dn(t 2 ). For 
the case where the component functions are extremely 
narrow in time compared to the time variations of the 
process n(t), we can approximate n(L) ^ n(( a ) « n(t). 
Hence, given the sample function n(l), the degenerate 
process J o (0 appears as a stochastic process with time- 
varying mean n(t), and an additive Gaussian noise of 
variance dn(t). The signal n(t) and the noise are, of course, 
not independent, which distinguishes the true degenerate 


K\*P AND Q AOUAHDI : RKPEESENTATION OF CONTINUOUS BTOCHA8TIC INTI NUTT 


145 


shot noise representation from the more common signal- 
in-ad ditive-noise representation. 

We can, in fact, expand this notion, and show that the 
condition d—* 0 allows even stronger conclusion conce rning 
the degenerate 7 o (0- The conditional correlation of the two 
random variables 7 0 (L) and 7 0 (f<), t t y* t h is given by 

E[Io{U)l 0 {t,)\N] = E[ Zh(t , - t m ) Zh(t, - t t )\N]. (20) 

" k 

Expanding out the double summation, performing the 
expectation [see (35)], and applying the degenerate 
condition d — * 0, yields 

£[7 o a)7o(<,) \N] - E[ 7,(1.) \N] E[I 0 (t t ) |N] . (21) 

4—0 

That is, the conditional correlation approaches the product 
of the conditional means as d — » 0. Thus, 7 0 (L) and 7 0 ((,) 
are uncorrelated, and with the Gaussian condition proven 
earlier, are in fact independent. This argument can then be 
extended to prove the conditional mutual independence 
of a sequence of variables j 7 0 (L) } . The eventual conclusion 
is that 

(22) 

where jr,) is an rth-order positive integer set. Thus, the 
rth-order moment of the degenerate process 7 o (0 ap- 
proaches the corresponding rth-order moment of the 
process N, and the boundedness of N is again sufficient to 
guarantee convergence of the general rth-order probability 
density of I 0 (t) to that of N. 

IV. Least-Mean-Square PSN Processes 

The more meaningful results in the previous section 
were obtained for the case of rectangular or extremely 
narrow component functions, with degenerate amplitudes. 
In this section, we investigate the validity of these 
assumptions by approaching the problem from a different 
point of view 7 . We attempt to determine if there exists an 
optimal component function h(t), 0 < t < r, that mini- 
mizes the mean-square difference between the shot noise 
process of (2) and an arbitrary nonnegative random proc- 
ess, V. We shall ultimately be interested in the particular 
case where the process V corresponds to the intensity of the 
shot noise, but initially we can allow some generality. 

We formulate the problem as follows. Given the PSN 
process of (2) with intensity n(() as a sample function from 
a random process N. Pet v(t) be a sample function from 
random process V. Let N and V both be stationary non- 
negative real processes with continuous cross correlation 
R.\r(t), and autocorrelations R N . V (() and R V y((), respec- 
tively. We define the mean-square difference 

J = E[t>(0 - 7(f)]* (23) 

where 7(f) is again the PSN process of (2). We seek the 
nonnegative component function h a (t), 0 < t < r, that 
minimizes J. Proceeding formally, we expand out (23) and 
compute the resulting expectations (the details are shown 


in the Appendix), yielding 

J = E(V) -2 [' h(t - f„)ft™(f - O dt m 

J l-r 

+ E(N) [' h\t - f.) dL 

J i-r 

+ f [' h(t — t m )h(t - t k )R N „(t m - t„) dL dt k . (24) 

J t-r J *-r 

Now by straightforward application of the calculus of 
variations, subject to the constraint h(t) = 0 outside (0, r), 
we derive the integral equation for the optimal component 
function h 0 (t), 0 < f < t, that minimizes J. This is 

R.,v (u)= E(N)ho(u) 

+ J h 0 (u — s)R nn (s) ds, 0 < u < r (25) 
= / h 0 (u - s)[R SN (8) 

J —CD 

+ E(N) S(s)] ds, 0 < u < r (26) 

where 5(s) is the delta function. The equation has the 
form of a Wiener-Hopf equation. (Indeed, the problem 
could have been formulated in the context of mean-square 
filtering, since h 0 (l) can be regarded as a filter-impulse 
function operating upon a PSN process whose component 
functions are delta functions.) The problems in obtaining 
a general solution to (26) are developed in treatments on 
Wiener filter theory and need not be repeated here. The 
resulting mean-square difference, when the solution to (26) 
is substituted into (23), is 

r _ J_ f [” V V (k>) [ *S N .V (to) -f- E(N)] iSy-y^ ) - ] i 

J ~'° ~ 2r L m L SnvM +E(N) J " 

(27) 

where S V v(u) is the Fourier transforms of R V v(t), etc. For 
our interest, we are concerned with the above development 
for the special case where n(f) = v(t)/t u as in (13). 
Under this assumption, R NV (t) = Rw(!)/*\ and R NN (t) =* 
Ry V {t)/t\. The corresponding equation for the optimal 
solution is 

Rrv(u) = f h 0 (u - s)p^ + E(V) «(•)] do, 



+ E(J)h 0 {u), 0 < u < r. (28) 

The solution for h a (l) is not in general a rectangular func- 
tion. This fact is not surprising, based upon our previous 
results, since we have removed the constraints in the 
derivation of (14). However, if we invoke the assumption 
that r is much smaller than the time variations in t>(f) 
[which by the continuity of the correlation functions 
allows us to consider Rw(s)zzRrr(u) for all s in (u— r, «)], 
then (28) has only the trivial solution h 0 (u) = 0, 0 < u < r. 
That this is indeed a minimizing solution can be verified 


146 


HU TRANSACTIONS ON information theobt, march 1970 


by substituting into (27) and noting that 

1_ r tl E(V)Syy( Q>) , 

Jmlm 2x J_. S rr («) + fl E(V) 

— » 0 as — > 0. (29) 

The above solution for /t 0 (u) corresponds to a form of 
degenerate PSN process introduced previously. Thus, the 
trivial solution to the desired mean-square-difference equa- 
tion actually has physical meaning in the application 
presented here, as was discussed in Section III. 

V. Conclusions and Applications 

We have investigated some aspects of the relationship 
between a PSN process and its intensity, when the latter 
is a sample function of a continuous nonnegative real 
stochastic process. In particular, we examined the dif- 
ference of the moments and the mean-square difference 
between the two processes. The continuity assumption on 
the intensity aided us in developing a sequence of moment 
relations that manifest the effect of the component 
function of the PSN. These results simplified, and afforded 
some degree of physical interpretation, when the com- 
ponent functions were taken as rectangles, or when the 
intensity did not vary appreciably over their time width. 
It has also been shown that a degenerate form of compo- 
nent function actually has meaning, and corresponds to 
the exact representation of the intensity by the PSN 
process. A principal conclusion of the paper is that a 
continuous nonnegative real process v(t ) can be “approxi- 
mately” modeled by a PSN process by allowing v(t) to be 
the intensity of the PSN, and properly assigning the 
component function. The results of the paper can be used 
for assessing the degree of approximation. 

One particular application is in the field of optical 
communications, where photodetector outputs are modeled 
as PSN processes. In this case, the component function 
corresponds to a current “pulse” induced by the arriving 
photons. These current pulses have time widths inversely 
proportional to the detection bandwidth, on the order 
of 10‘ B second. The intensity modulation has bandwidths 
on the order of 10 3 -10 8 Hz. Thus, the assumption of little 
intensity variation over the time width of a component 
function is appropriate in this application. The average 
photon intensity of a coherent optical signal is given by 
P/hf, where P = the average transmitter power, h = 
Planck’s constant, and / = frequency of the optical mode. 
The condition of the denseness of the shot noise (19) is 
therefore equivalent to the statement that P/hfB be 
sufficiently large, where B is the detector bandwidth. 
Since hf is often considered the intrinsic quantum noise 
spectral level, the parameter P/hfB appears as a power 
signal-to-noise ratio of the optical radiation. Hence, the 
detector output PSN process models the stochastic 
intensity modulation if the quantum signal-to-noise ratio 
is sufficiently high. 


Appendix 

The expansion of J in (23) is 
J = E[v\t) - 2v(t) Z h(t - t.) 

+ EEA((- t m )h(t - <*)] (30) 

m k 

where we simplified P(— <*>, f) to p. The expectation of 
the second term can be written 

e\v( 1) ± h(t - o] 

= tf r .jr{f(0£...,[ ± h(t - O | N , f]} (31) 

where we have denoted by subscripts the variables over 
which we are averaging. By using (1) and (3), we have 

Z h(t - o] = E(p) h(t - t m )p, m (L) dt m 

= f h(t — t m )n(t m ) dt m . (32) 
Substituting into (31) then yields 

Z h(t - o] 

= f h(t - t m )E[v(t)n(t m )] dt m . (33) 

The third term in (30) contains terms for which m = k 
and m ^ k. For the former 

p[z h\t - o] = Z h\t - t m ) I n] 

= E n J h\t - L)n(l m ) dt m 

= £(N) f h\t - O dt m . (34) 

For the m ^ k terms, we have, by similar steps, 

£'[z ZM< - Uh(t - < t )] 

= E.^Etf - p) /’ J "h(t - t m ) 

■h(t - t„)p(t m )p(t k ) dt m dt}j 

= E„ J J h(t — t m )h(t - t k )n(t m )n(t„) dt m dt k (35) 

where we used the fact that E(p * — p)/E i (p) = 1 for a 
Poisson random variable. The substitution of (33), (34) 
and (35) in (30) then yield (24). 


IEEE TRANSACTIONS ON INFORMATION THEOBT. VOL. IT-16, NO. 2. MARCH 1970 147 


References 


[1] S. O. Rice, “Mathematical analysis of random noise,” Bell Sys. 
Tech. J., vol. 23, pp. 282-332, 1944. 

[2] E. Parzen, Stochastic Processes. San Francisoo: Holden-Day, 
1962, p. 113. 

[3] D. Middleton, Introduction to Statistical Communication Theory. 
New York: McGraw-Hill, 1960, ch. 11. 

[4] , “A statistical theory of reverberation and similar first- 


order scattered fields — Parts I and II,” IEEE Trans. Information 
Theory, vol. IT-13, pp. 372-414, July 1967. 

[5] S. Karp, R. Gagliardi, and I. Reed, “Radiation models using 
discrete radiation ensembles,” Proc. IEEE, vol. 56, pp. 1704- 
1711, October 1968. 

[6] R. Franklin, A Treatise on Advanced Calculus. New York: 
Wilev, 195S, ch. 4. 

[7] H. Cramer, Mathematical Methods of Statistics. Princeton, 
N. J.: Princeton Univereity Press, pp. 185-187. 

[8] Ibid. pp. 174-179. 


fi-ts- 0 966 f 




Reprinted by permission from IEEE TRANSACTIONS ON INFORMATION THEORY 
Vol. IT-18, No. 1, January 1972, pp. 208-211 
Copyright 1972 by The Inst itute of Electrical and Electronics Engineers, Inc. 
PRINTED IN THE U.S.A. 


Photon Counting and Laguerre Detection 

ROBERT M. GAGLIARDI 


Abstract — In this correspondence maximum-likelihood binary detec- 
tion theory is applied to an incoherent optical system model employing 
photodetectors governed by Laguerre counting statistics. It is shown 
that a maximum-likelihood Laguerre detector corresponds to a count 
comparison over each signaling interval. Laguerre error probabilities are 
presented and compared with those for Poisson counting. 

Introduction 

In optical communication systems binary data bits are often trans- 
mitted by sending light in one of two possible adjacent time intervals. 
When incoherent photodetcction is used in each interval, the receiver is 
modeled as a counter of photons. The synthesis of the optimal receiver 
processing and its resulting performance therefore depend upon the 
statistics associated with this counting. In early work, the probability 
density of the counts was almost exclusively assumed to be Poisson, 
and the optimum receiver processing and performance [1 ]— [3], [8, pp. 
207] were determined. Count statistics are in fact only conditionally 


Manuscript received March 5, 1971 ; revised May 20, 1971. This work was sponsored 
by the National Aeronautics and Space Administration, under NASA Contract 
NGR-05-018-104. 

The author is with the Department of Electrical Engineering, University of Southern 
California, Los Angeles, Calif. 90607. 


CORRESPONDENCE 


209 


Poisson (e.g., see [4, pt. 1]), and the probability of A photoelectron 
counts occurring in a single spatial mode during a (0,T) s counting 
interval is given by 

P(k) = \ ~e-*p(x)dx, (1) 

Jo A! 

where p{ x) is the probability density of the random energy variable 


When (5) and (7) are substituted into (2), the Laguerre probability 
takes the form 

P L (k\ E,N,D) = P p (k ; E,DN) + C(k,D,E,N) + 0flD|" 2 ). (8) 

Here the function C appears as a first-order correction term to the 
Poisson probability and is due to the sum term in (7). If we apply the 
fact that 


x 



1 /( 01 * * 


(k - /)(£ + j - 1) ^ A 3 + k 
2D ~ 2D ’ 


j> 0 


(9) 


and /(/) = detected optical field in a spatial mode at the photo- 
detector. When the detected field is the sum of a deterministic signal 
field and Gaussian white, band-limited thermal noise of temperature 
(i and bandwidth B, and if the noise energy per each significant space- 
time mode is assumed equal, (1) is known to be [5] 


N( 1 + /V)J ' 
(2) 


W 


P L (k\ E,N,D) = L exp 

(1 + N ) D * k * 1 


E 

Lx D 

1 + N 



Here N is the average number of noise counts per time-space mode, 
given by Planck's formula N = [exp (hf/kp) — 1]“ *, where h is 
Planck’s constant, k is Boltzman’s constant, and /is the optical carrier 
frequency. The parameter E is the average number of signal counts 
[signal energy over (0,7") divided by hf], L k D ( • ) is the Laguerre poly- 
nomial 


L„% <r) 


* (k + D\ C-xr 

ihXk-i) mi ' 


(3) 


to (7), we can bound the correction term C in (8) by 

C{k,D,E,N) < Pp(k ; E.DN). (10) 

Thus a bound can be placed on the first-order difference between 
Laguerre and Poisson probabilities. It implies that the probabilities 
may differ significantly over the tails, i.e., for large k. However, the 
form of (10) does allow a rough rule for bounding D. If, for example, 
the difference between Laguerre and Poisson probabilities is to be 
within a fraction y of the Poisson value over a range of k, say k < k 0 , 
then D should at least satisfy 

D > (ko 1 + k 0 )/2y. (11) 

If D does not satisfy the above, then with certainty (P L — P p )IP, > y 
for some k < k a . If D satisfies (11), the effect of higher order correction 
terms should be considered, although for D » I, the first-order 
correction will predominate. (For example, a megabit system operating 
at 10 p with a 1-A optical filter will generate a D of about 400.) 


and D = 2 BT is the time-bandwidth product, often called the count 
dimension. Physically, D + I is the number of temporal modes 
observed in a single spatial mode during the count interval. The 
probability in (2) is called a Laguerre probability, and we refer to the 
associated count statistics as Laguerre counting. Our objective now is 
the application of maximum-likelihood detection theory to an optical 
digital system governed by Laguerre counting. Earlier work in this area 
by Liu [9] and Helstrom [10] dealt with the case D = 0. 

We may first digress to examine the conditions under which Laguerre 
counting can be replaced by Poisson counting. It has been shown [5] 
that if N -* 0, D -* oo, in such a way that DN remains fixed, the 
Laguerre probability in (2) is asymptotic to the Poisson probability 

(E + DN) k 

P p (k ; E,DN) = — - exp [-£ + DN)] (4) 

A! 


at every A. In a practical situation the condition N « 1 is generally true, 
since N is on the order of 10" 7 — 10" 6 for typical background noise 
sources and visible wavelengths. However, the dimension D depends 
upon the data rate being transmitted. A question then arises as to how 
large D should be in order to replace the Laguerre probability by the 
Poisson in analysis. A first-order condition can be determined by noting 
that i( N « 1, the first two factors in (2) are, to a good approximation, 

D+t / N y 

I j-^l exp [—£/(l + A0] x N k exp -(£ + DN). 

(5) 



The Laguerre term in (3) can be rewritten by applying the asymptotic 
relation for the ratio of gamma functions [6, p. 15] 


(D + A)! 
(D + /)! 


D*-J 


(A - m +j - D l 

2D 


+ OODI" 1 ). (6) 


Maximum-Likelihood Laguerre Detection 
Consider an optical system in which two adjacent time intervals are 
used for binary signaling. Assume optical signals of equal energy are 
transmitted, additive background noise of constant energy level is 
encountered, and identical dimensions D exist during each counting 
interval. Let a binary one be represented by signal energy Ehf in the 
first interval and let a binary zero be represented by energy Ehf in the 
second interval. The receiver photodetects the received field (trans- 
mitted bit signal plus background noise) over each interval, producing 
photoelectron counts obeying the Laguerre statistics in (2). A decoder 
follows the photodetector and performs a maximum-likelihood test for 
deciding between the hypothesis of a binary one being received H , or 
a binary zero H 0 . If A, is the count over interval r, the decision is 
therefore based upon the observed count vector A = (A,,A 2 ). If t 
p(k | //,) is the probability of A occurring when H, is true, then the 
maximum-likelihood test corresponds to the decision rule 

decide if p(k | H,) (5)P(* I Ho) (12) 

while an equilikely random decision is made when the p(k | H,) are 
equal. For Laguerre counting, this corresponds to a comparison of the 


densities 





P(k\ 

H x ) = P L (ki \ £; N,D)P L (k 2 \ 0,N,D) 

(13) 

and 





p(k 

H 0 ) = P L (.k ! ; 0,N,D)P L (k 2 ; E,N,D). 

(14) 


Substituting from (2) and canceling common terms for a given k 
yield the equivalent comparison 

L k °(-e)L k2 D ( 0) $ L tl D (0)L t2 D (-e) (15) 

or 


This allows the Laguerre polynomial, for N « 1, to be written as 

(A - j)(k + j - 1)1 


L k ° 


N( 1 




D k -J 




2D 

+ 0(|D|- J ). (7) 


L k , D (-e) > L,,(0) 
U 2 °(-e) K L k2 ( 0)’ 


(16) 


where e = E/N(\ + N). When A, > k 2 the function L kl D (—e)/ 
L k , D ( — e) is monotonically increasing in e, which guarantees that the 
left side of (16) exceeds the right side. Similarly, when k 2 < A,, the 
converse is true. This means the maximum-likelihood test in (12) is 


210 


IEEE TRANSACTIONS ON INFORMATION THEORY, JANUARY 1972 


equivalent to the test 

decide if*i(5)*4. 

decide H, with probability } if*! = k 2 . (17) 

Thus, maximum-likelihood decoding with Laguerre counting requires 
only a count comparison over each interval. 

When the maximum-likelihood test is implemented, the correspond- 
ing Laguerre error probability is given by 

® *'~ l / 0 + k 

PE L (E,N t D) = 1 - A £ B k iL kl D (-e) £ *» / 

* 1=0 *2 = 0 \ *2 

- (A/2) f,^B lk ‘L kl D (-e) ^ , (18) 



where A = (1 + A)~ 2<0+l> eX p [-£/(! + AO] and B = 7V/( 1 + AO. 
The probability in (18) has been computed for several values of the 
parameters. Some typical plots are shown in Fig. I as a function of 
signal count E and crossplotted in Fig. 2 as a function of the counting 
dimension D. Fig. 2 is particularly useful for demonstrating the 
advantage of using narrow energy pulses (reducing T while keeping E 
fixed for a given optical bandwidth B ) and thereby reducing error 
. probability. This, of course, is simply a reiteration of the obvious fact 
that one should encounter as few noise modes as possible, while the 
signal energy should occupy the full available optical bandwidth. 
However, it can be seen that the advantage gained in PE is relatively 
small at low noise levels. 

If Poisson counting had been assumed in each interval, with the same 
signal and total noise energies, the count probability in (4) would be 
used, and the error probability would instead be 


® * (DN) k HE + DN) k < 

PE,(E,DN) = Z I -r , — — exp -[£ + 2 DN) 

k,=0k 2 =k, k,\k 2 \ 


I ® [(£ + DN)DNp 

~ 5 £ . , ' , exp -[£ + 2£W], (19) 

2* =0 k 2 \ k 2 \ 


The error probability in (19) is easier to compute and parameter studies 
have been extensively published [3], [7], [8, p. 215], Some Poisson 
results have been superimposed in Figs. I and 2 to illustrate the 
difference between true error probabilities (Laguerre) and approximate 
error probabilities (Poisson). Two conclusions are immediately 
evident. The Poisson error probabilities are universally lower (more 
optimistic) than the corresponding Laguerre probabilities. Second, 
when A r « I , PE, yields a fairly accurate approximation to PE L , even 
if the dimension D is not particularly large. This fact appears to 
indicate that the discrepancies between P L (k ) and P,(k) over the tails 
of the densities have little effect on error probability when the noise per 
mode is small. This is further emphasized in Fig. 3 in which £ and DN 
are held fixed, while PE t . is plotted as function of D. The PE, value 
for the same £ and DN is shown as an asymptote. The PE L curve 
approaches PE, asymptotically from above as D increases, and the 
corresponding N decreases. In other words, for fixed signal and noise 
energies. PE L is accurately predicted by PE, if the noise energy is 
produced from a relatively low N value. 

The magnitude of the difference between PE,, and PE, depends upon 
the actual values of D, E, and N. This can be seen by investigating the 
behavior of the two functions at D = 0. which is the point at which the 
largest difference occurs. When 0 = 0. PE,(E, 0) = 1 exp (— £]. The 
corresponding Laguerre limit can be determined by noting that for 
0 = 0, (18) is 


'«*» - (rrs)' (^r) 


exp [-£/(! + A 1 )) Z B 2k L k °(E ). (20) 

k = 0 




By applying a Laguerre identity [II, eq. (8.975)], and manipulating 
algebraically, the above becomes PE L (E, N,0) = i exp [— £/( 1 + 2A f )]. 
The ratio is then 


PEA 2 NE 

= exp — . . 

P£,|d = o L l+2yv J 


( 21 ) 


The above shows that the ratio of the two error rates depends explicitly 
upon the EN product. When expressed in this manner, one can append 
earlier statements and conclude that PE, and PE L are fairly close over 
the range of all 0 > 0 if both N « I and NE « 1. 

In arriving at the above results it should be pointed out that the 
assumption of equal modal energy used in the derivation of (2) is 
strictly satisfied for white noise only for 0 = 0 and 0 » I. For 
intermediate 0 values, say, 1 < 0 < 10, this assumption is definitely 


violated, and the PE L points plotted using (2) for this range of D must 
be accepted as an assumed logical extension of the more accurate 
endpoints. 


References 

(1] B Rciffen and H. Sherman, "An optimum demodulator for Poisson processes," 
Proc. IEEE, vol. 51. Oct. 1963, pp. 1316-1320. 

(2] K. Abend, "Optimum photo detection," IEEE Tram. Inform. Theory (Corresp.), 
vol. 12, Jan. 1966, pp. 64-65. 

(3] R M Gagliardi and S. Karp. "M- ary Poisson detection and optical communica- 
tions," IEEE Tram. Commun. Technol., vol. COM- 17, Apr. 1969, pp. 208-216. 

(4J S. Karp, E. L. O’Neill, and R. M. Gagliardi, "Communication theory for the 
free-space optical channel," Proc. IEEE, vol. 58, Oct. 1970, pp. 1611-1626. 

[5] S. Karp and J. R. Clark, "Photon counting — A problem in classical noise theory," 
IEEE Trans. Inform. Theory, vol. IT-16, Nov 1970, pp. 672-680. 

[6] N. Lebedev. Special Functions and Applications (English transl.). Englewood, 
N.J.: Prentice-Hall, 1965. 

[7J S. Karp et a!., "Error probabilities for Poisson detection," NASA Tech. Note 
TN-D-4721, Oct. 1968. 

(8) W. K. Pratt, luiser Communications. New York: Wiley, 1969. 

(9) J. W. S. Liu. "Reliability of quantum-mechanical communication systems," 
IEEE Trans. Inform. Theory, vol. 1T-I6, May 1970, pp. 319-330. 

(I0J C. W. Helstrom, "Performance of an ideal quantum receiver of a coherent signal 
of random phase," IEEE Trans. Aerosp. Electron. Syst. (Corresp.), vol. AES-5, 
May 1969, pp. 562-564. 

[II] I. Gradsteyn and I. Ryzhik, Tables of Integrals, Series and Products. New York: 
Academic Press, 1965, p. 1037. 




ft 


<7 



The Effect of Timing Errors in Optical Digital Systems 


ROBERT M. GAGLIARDI, member, ieee 


Abstract — The use of digital transmission with narrow light 
pulses appears attractive tor data communications, but carries with 
it a stringent requirement on system bit timing. The effects of im- 
perfect timing in direct-detection (noncoherent) optical binary 
systems are investigated using both pulse-position modulation 
(PPM) and on-off keying for bit transmission. Particular emphasis 
ii placed on specification of timing accuracy and an examination of 
•yatem degradation when this accuracy is not attained. Bit error 
probabilities are shown as a function of timing errors from which 
average error probabilities can be computed for specific synchroni- 
zation methods. Of significance is the presence of a residual or ir- 
reducible error probability in both systems, due entirely to the timing 
system, which cannot be overcome by the data channel. 

I. Introduction 

T HE ABILITY to generate extremely narrow high- 
energy light pluses from a laser source has made 
the optical transmission of digital data extremely 
attractive for modern communications. This possibility 
has fostered an exhaustive exploration of optical com- 
munication systems, from both a theoretical and hard- 
ware point of view (e.g., see [1]). The use of digital 
transmission with narrow pulses, however, carries with 
it an extremely stringent requirement on system bit 
timing, i.e., time control of the system sampling and 
integration intervals during each data bit. For the most 
part, past analytical studies have assumed perfect sys- 


Par >er approved by the Communication Theory Committee of 
the IEEE Communications Society for publication without oral 
presentation. This work was supported by NASA under Contract 
NGR-05-018-104. Manuscript received September 7, 1971; revised 
November 12, 1971. 

The author is with the Department of Electrical Engineering. 
University of Southern California, Los Angeles, Calif. 90007. 


tern timing, and the degradation caused by timing errors 
in optical systems have been virtually ignored. In this 
paper, we investigate the effects of imperfect timing in 
a direct-detection (noncoherent) optical communication 
system, with particular emphasis on the specification 
of timing accuracy, and an examination of the system 
degradation when this accuracy is not attained. 

Consider a general optical digital system as shown 
in Fig. 1(a). The system sends bits of information by 
transmitting bursts of optical energy. One of two pos- 
sible methods are usually used for encoding the bits. 
In one, the system operates by transmitting a burst of 
energy in one of two T-second adjacent time intervals 
to encode a binary bit. This represents a two-level pulse- 
position modulated (PPM) mode of transmission and 
is known to be optimal under various criteria, when 
constrained in average transmitter power [2]. Thus, 
for example, the binary sequence 0110 would be trans- 
mitted by the optical waveform shown in Fig. 1(b), 
where the pulse represents a burst of optical laser 
energy. We have considered an energy pulse in the first 
interval to represent a binary one, and an energy pulse 
in the second interval to represent a binary zero. A 
second procedure is to use on-off keying, in which the 
transmitter uses an energy burst for a one, and trans- 
mits no energy for a zero. Thus, the waveform 0110 
would be transmitted by the energy waveform in Fig. 
1(c). Note that if T is the energy pulsewidth, then in 
PPM 2 T is the bit interval and information is being 
transmitted at a rate 1/2T bit/s, while in on-off keying 
T is the bit interval and the rate is \/T bit/s. 



IEEE TRANSACTIONS ON COMMUNICATIONS, APRIL 1972 


data 

optical 

source 


opticol 


receiver 

recovered 

bits 


chonnel 


bits 


0 

A 

(o) 

1 

0 


1 



1 



0 T 2T 3T 4T 5T 6T 7T 8T 
(b) 


OIIO 


i I I I I I I i L 

0 T 2T 3T 4T 

(C) 

Fig. 1. (a) Digital optical system, (b) PPM energy waveform 
for 0110. (c) On-off energy waveform for 0110. 



Fig. 2. Optical receiver for digital systems. 


The digital receiver for the system is shown in Fig. 2. 
We shall assume that transmitter and receiver operate 
diffraction limited, so that the transmitted energy cor- 
responds to optical energy in a single spatial mode of 
the optical beam. The received optical beam is photo- 
detected, and its output is integrated over a T-second 
interval. The start-stop timing for this integration is 
provided by a synchronizing subsystem. In PPM, the 
bit decoder makes a comparison of the integrator out- 
put after the first T-second interval of each bit period 
with that after the second T-second interval, deciding 
a one or zero accordingly. In on-off keying a threshold 
test is made at the end of each bit time T, the bit de- 
cision depending upon whether the threshold is ex- 
ceeded or not. The latter system requires accurate 
knowledge of the expected signal and noise energies 
in order to properly set the threshold, representing a 
serious disadvantage to on-off operation. 

If the output of the photodetector is modeled [3] as 
a wide-band shot-noise process (detector bandwidth 
» l/T), then the integrator output after T seconds 
of integration, beginning at time t, is proportional to the 
shot-noise counting process k(t, t + T), where 

k(t , , t ,) = number of photoelectrons in (f„ f 2 ). (1) 

In stating that the integrator value is proportional to (1), 
we have neglected additive circuit thermal noise, which 
implies the use of high-gain ideal photomultipliers in the 
photodetection operation. The counting process k ( • , •) 
of the photodetector shot noise is a random point process 
over the nonnegative integers. For the reception of an 
optical field over (0, T), with the signal energy E and 


additive white Gaussian background noise of bandwidth 
B 0 , the probability that the count value A(0, T) equals 
integer k is known to be [4] 

Pr [*(0, T) = k] 

= P L (k; S, N, D) 

( S) k f S_ __S_\ 

(1 + No) 0 *'" exp L 1 + N 0 ] L> \ (1 + N 0 )N 0 ) 

( 2 ) 

where S = GE/hf is the average signal count over 
(0, T); N 0 = G'|exp (hf/kT c - 1)]-' is the average 
noise count per mode due to background at temperature 
T r ; h is Planck’s constant; / is the laser frequency; D 
= 2 B 0 T, G is the photomultiplier gain; and L k D (x) is 
the Laguerre polynomial in x of order D and index k: 



The parameter D is the count dimension or time-band- 
width product. Physically, D + 1 is the number of 
temporal modes observed during tbe T-second counting 
interval. The density P,,(/c; S, N, D) is called a La- 
guerre counting density and is exact for D = 0 and 
D » 1 , but is only approximate for D ~ 1 . (This is 
due to the fact that (2) requires equal eigenvalues in 
the expansion of the energy function, which is only 
approximately true for low values of D.) The received 
average signal energy E over the time interval T can 
also be written as E = Q t T, where Q, is the received 
average power. We then have, alternately, 

S = (GQ./hf)T = ».T ( 4 ) 

where ji, is the average count per second (count rate) 
due to the signal. 

Under typical operating conditions, we generally (lave 
N 0 <£ 1 and D » 1, and (2) asymptotically approaches 
the Poisson density [4] 

Pr [*(0, T) = k] = P,(k ; S + N) 

= (S + k[ N)k exp [~(S + N)] ( 5 ) 

where N = DN 0 represents the total noise count in all 
modes. (For visible wavelengths, N 0 is generally on the 
order of 10" 7 -10~° counts/mode. An optical system at 
10 fx operating with a 1-A optical filter and T = 10~° s, 
will generate a D of about 400.) Note that with the 
Poisson assumption, the count probability depends only 
upon the sum of the signal and noise count. That is, 
the count statistics do not distinguish between the effect 
of signal energy or noise energy, but are determined 
solely by their cumulative energy. 

II. Error Probabilities 

If we transmit a binary PPM signal with fixed signal 
energy in the signaling interval, then the probability 
of making a bit error is simply the probability that the 


GAGL1ARDI : TIMING ERRORS IN OPTICAL DIGITAL SYSTEMS 


88 


count- in the nonsignaling interval exceeds or equals 
that of the signaling interval. (If the counts are equal, 
an equally likely random choice is made concerning 
that particular bit.) If we denote fc< as the count in the 
?th interval, i = 1, 2, of a bit, then the average error 
probability PE is 

PE = i Pr [fc 2 > k, | one sent] + } Pr [A, > A, | zero sent] 

+ $ 1 1 j Pr A 2 = fc, | one sent] + i [Pr A, — k 2 \ zero sent] ) . 

<B) 

From the symmetry of the transmission method, some 
of the terms above combine and the result simplifies. 
Thus, for Laguerre counting, (6) becomes 

PE t (5, N 0 , D) 

= S, No, D)P L (k 2 ; 0, N 0 , D) (7) 

4,-0 

where 7i, = 5 for A 2 = k , and is one otherwise. (The 
^ factor accovints for the effect of equal interval counts.) 
Note that the error probability using Laguerre counting 
PE t depends explicitly on the count dimension D (time- 
bandwidth product). If the Poisson assumption is ap- 
plicable, the probabilities in (7) are replaced by those 
of (5), and we have 

PE p (5, N) = f) E y k .P r (k u S + N)P f (k 2 , N). (8) 

A , -0 A 1 - A 1 

We see that the Poisson error probability PE p depends 
only upon the parameter I) through the total noise 
count N — DN 0 . The Poisson error probability is easier 
to compute than that using Laguerre counting, and 
parametric studies of (8) have been extensively pub- 
lished [2], [5]. A typical plot of PE p is shown in Fig. 3 
as a function of the signal count S. Some PEt points 
obtained by computing (7) at the same total noise level 
are superimposed. Further comparisons of Poisson and 
Laguerre error probabilities, in terms of the parameters 
involved, are discussed in [6], The primary conclusion 
is that at low noise levels (A’ 1), it can be conjec- 

tured that PE t ~ PE p for moderate (/) ~ 1001 dimen- 
sions. 

When on-off keying is used and a threshold test is 
made at the end of each pulse time T, an error is made 
whenever the integrator value is on the incorrect side 
of threshold. If K is the a priori selected threshold count 
value, then the error probability becomes 

PE p (8, N) 

-5E 7*P,(fc, S + N) + \ £ yj>,(k, N) (9) 

where again y K = J for k — K and is one otherwise. 
For Laguerre statistics, the probabilities on the right 
should be replaced by the Pl terms in (2). For Poisson 
counting the sums in (9) are cumulative Poisson prob- 
abilities and are well tabulated (e.g., see [9]). 



Fig. 3. Error probabilities versus signal count for PPM. N — 
noise count; D — count dimension. 


adjocent I I adjacent 

•77 bit intervol ■ 



i 1 i 1 1 

counting 

" interval * 

Fig. 4. Effect of timing error on counting interval in PPM. 

III. Timing-Error Effects in PPM 

The primary assumption in (8) is that the bit timing 
is perfect and the decoder counts photoelectrons exactly 
over the two T-second intervals that constitute a bit. If 
a time offset of A seconds occurs during a bit period, due 
to timing errors in synchronization lockup, then the 
counting occurs over an offset interval. That is, the de- 
coder starts and stops counting over a T-second interval 
that is displaced by A seconds from that containing the 
bit information, as shown in Fig. 4. As a result only a por- 
tion of the true signal energy is included in the signal 
count, while some signal energy may contribute to the 
count in the adjacent interval, causing intersymbol inter- 
ference in the form of energy spillover. The effect of 
this interference depends upon the form of the adjacent 
bit; i.e., whether it contains signal energy or not. Assum- 
ing a positive timing offset (0 < A < T), the various 
effects on the counting statistics are summarized in 
Table I, where n, is the average signal count rate in (4). 
If we let S = n,T be the average count over T due to 
signal energy, and assume equiprobable bits, the error 
probability for a positive timing error A, averaging 
over all possibilities given in Table I, is then 

PE,|A=^ ± £ y k .P,[k u S(l-*)+N]P p [k t ,N+S*] 

* A.-O A,-A» 

+!ii yk.PJk» S(l-')+N]PJ[h, N] 

* Ai-0 A. -A, 

+!EE y*.P,[k „ S f +N]P r [S+N] (10) 

* A,-0 A, -A, 


90 


IEEE TRANSACTIONS ON COMMUNICATIONS, APRIL 1972 


TABLE I 


Trans- 

mitted 

Bit 

Sub- 

sequent 

Bit 

Pr X(0, T) = k, 

Pr X(T, 2 T) - t, 

1 

0 

Pp{k,, n.(T — A) + AH 

Pp[*i, V) 

1 

1 

PAk uli .(T- A) + V] 

P,\k t , N + „.A] 

0 

1 

/',!*!, + N | 

AA»7' + AJ 

0 

0 

Pp[k 1, A*. A + AH 

P,[*,,m.(T- A)+AT| 


where < = A/7 1 is the percentage timing error. The 
error probability for negative time shifts will be identi- 
cal to the preceding, when all possibilities are considered, 
if we interpret < = |a|/T when A < 0. Note that if each 
of the double-sum terms in (10) is compared to (8), 
which assumed perfect timing, we can rewrite (10) as 

PE, | A = \ PE, (S', N') + } PE,(<S", N) 



where 


-f- J PE, (S", N') (11) Fig. 5. Error probability versus timing error in PPM. S — signal 

count; N — noise count. 


S' = S(1 - 2e) (12a) 

S" = S(1 - «) (12b) 

N' = N + St. (12c) 

Thus, timing errors in PPM can be accounted for by 
merely reinterpreting the effective signal and noise count 
per T interval while assuming perfect timing. Note that 
the timing errors always act to reduce the effective 
signal energy, while increasing the effective noise, the 
overall result degrading the error probability. It is im- 
portant to realize that the fact that the spilled over 
signal energy appears as effective noise energy is in- 
trinsic in the Poisson assumption and is valid as long 
as (8) describes the error probability. 

A plot of (11), obtained by digital computation, is 
shown in Fig. 5 for positive or negative timing errors. 
The results show a relatively fast increase in PE (sys- 
tem degradation) as the offset |a| is increased. The 
system is essentially ruined (PE ~ 0.5) when c ~ 0.5 
or when |a| ~ T/2. This is the point where the effective 
signal-to-noise ratios S'/N' and S"/N' are equal to or 
less than unity. 

A lower bound to the system performance as S -* oo 
is included, obtained by envoking the fact that at low 
noise levels Poisson error probabilities and Laguerre 
error probabilities with the same total noise are roughly 
equal, as pointed out before. Since the PE L monoto- 
nically increases with the parameter D, the use of PE*, 
at D = 0 will serve as a lower bound for error prob- 
ability. When the^ signal has count S and the additive 
noise count is N, (7) with D = 0 is 


r El i... - 


•exp 


[i^]£(nbNwT«)- <13) 


By applying a Laguerre identity [8, eq. 8.975] and 
manipulating algebraically, the above becomes 



If we substitute the effective S and N from (12) into 
(14), and use this as a lower bound for each term in 
(11) , we have 

PE, | A ss PEi | A > PE t | A; D = 0 

, f -5(1 - 2 «) 1 , . f -S(l - t) 1 

* CXP Ll + 2N + 2t<Sj + * CXp L 1 + 2 N J 

+ * exp [l + 2N + 2 J ' (15) 

Now as S oo, 

lim PE, | A > } exp + J ex P [~^2T\] ' ^ 

The above lower bound depends only upon t and is 
plotted as the S = °c curve in Fig. 5. The result is inter- 
esting in that it shows that even as S -* oo, a relatively 
sharp system degradation can still be expected. This 
can be attributed again to the fact that timing errors 
cause a portion of the signal energy to appear as noise 
energy. Therefore, even though an “infinite” signal 
energy is available, there is consequently an “infinite” 
noise energy present, whenever e 0, the overall result 
not dependent upon S at all as (16) illustrates. 

The behavior of PE p at different noise counts is shown 
in Fig. 6 for fixed value of S. Again, even with negligible 
background noise, the system degrades in a similar 
fashion with increasing timing error. 

IV. Timing Error Effects With On-Off Keying 

When on-off keyed data bits are transmitted and 
threshold tests are used for bit decisions at the decoder, 


OAGLIARDI : TIMING ERRORS IN OPTICAL- DIGITAL SYSTEMS 


91 



Fig. 6. Error probability versus timing error in PPM. S — signal 
count; N — noise count. 

I 

the effect of timing errors can be determined by a pro- 
cedure similar to the PPM case. The actual bit decisions 
will be influenced by the adjacent bit (the subsequent 
bit when A > 0, the former bit when A < 0) , just as in 
the previous case. If we consider the four possible confi- 
binations of transmitted and adjacent bits, and the as- 
sociated error probability for each, the total error prob- 
ability when a threshold K is used and an offset A occurs, 
is then 

PE, | A = \ £ y k Pp[S + N] + \ i y t P v [N] 

*-0 ^ k-K 

11 00 

+ :E 7*Pp[S(l - *) + N] + j T. 4- N] (17) 

7 *- o 7 k-K , 

10 01 

where again t = | A \/T and S, N are the received signal 
and noise counts, respectively. The symbols below each 
sum represent the combination of data bits causing the 
corresponding error probability, with the left-hand bit 
the transmitted bit and the other the adjacent bit. Com- 
parison of (17) with (9) allows us to write 

PE, | A = i PE, (S, N) + } PE, (S', N') (18) 

where the terms on the right are error probabilities with 
perfect timing, and S' and N' are defined in (12). We 
again observe that timing-error effects can be interpreted 
as degradations in signal energy and increases in noise 
energy in a perfectly timed system. Note that timing 
errors are exhibited only in the second term in (18), and 
can be attributed to the last two terms in (17), where 
the adjacent bit is opposite from the true bit. The error 
probabilities in (18) depend upon the choice of thres- 
hold K used for decisioning. For a given design value of 
S and N, the threshold K that minimizes (9) can be de- 



Fig. 7. Error probability versus timing error in on-off keying. 

S — signal count; N — noise count. 

termined by differentiation, and shown to be 

K = log (1 + S/N)' (19) 

With this threshold, (18) is plotted in Fig. 7 as a func- 
tion of timing offset for several values of S and N. The 
curves manifest similar behavior as in PPM, except the 
degradation is faster, and the curves exhibit crossovers. 
That is, at small offsets increasing S decreases error 
probability, but at larger offsets the opposite is true. An 
examination of the sums of (17) will reveal that for N 
<K 1, S » 1 the first three terms tend to zero and the 
resulting PE, | A is directly attributable to the last 
term; i.e., the error probability when a zero is sent and 
the adjacent bit is a one. In the limits as S —* °o, it fol- 
lows that even though K — * co [see (19)], this latter 
probability becomes exactly one for any « ¥= 0. The 
overall PE, | A therefore becomes 0.25, and the result is 
plotted as the S = °o curve in Fig. 7. The behavior of 
all these curves can be directly attributed to the fact 
that optimal on-off keying requires proper threshold 
selection, and timing offsets cause changes in effective 
signal and noise energies and, hence, suboptimal opera- 
tion. As these effective energies become widely different 
from the design energies, the resulting system perform- 
ance is severely degraded. 

V. Random Timing Errors 

The timing error that does in fact occur during a bit 
interval depends upon the synchronization subsystem 
and its performance in maintaining time lock. This is 
generally accomplished by tracking a transmitted sync 
signal with a locally generated sync signal using a feed- 
back tracking loop for error control. The timing error A 
is therefore the tracking error between the received and 
locally generated sync signals, and in reality should be 
considered as a random process in t. In typical operation, 
however, the loop-tracking bandwidth is much less than 


92 

the bit frequency l/T, and the assumption of a constant 
timing error during a given bit interval is essentially 
valid. The error is however random, and its statistics will 
depend upon the tracking-loop model. When sinusoidal 
sync signal at the bit frequency l/T are used, and track- 
ing is accomplished by a phase-lock loop (PLL) follow- 
ing photodetection, the steady-state probability density 
of A is given by 



where p v (?) is the density of the loop-tracking phase 
error y>. This latter density has been investigated for a 
system using a separate optical channel (different optical 
frequency) for transmitting the sync information [7]. 
When the sync channel is in quantum-limited operation 
(high-gain photomultiplication and negligible back- 
ground energy), the steady-state probability density of 
the phase error has been approximated by computer so- 
lution of the Smoluehow'ski-Kolmogorov equation [10]. 
The latter equation is a nonlinear partial differential 
equation for the probability density of an output random 
process (in this case, the tracking error) of a dynamical 
system (the phase-tracking loop) when forced by an 
input random process (the photodetector output of the 
sync channel). The results of this computer solution, 
reported in [7], are shown in Fig. 8. The phase-error 
density depends only upon the parameter 



where B L is the tracking-loop bandwidth and /x, c is the 
average count rate due to the sync signal, the latter 
directly related to the received power in the sync chan- 
nel. The parameter a is therefore the average number 
of sync-signal counts occurring in the time period 1/2 B L . 
The bandwidth B L must be selected large enough to al- 
low suitable dynamical tracking of the incoming sync 
phase shifts (due to Doppler, range uncertainty, and 
oscillator phase jitter). For a > 3, the phase densities 
are, to a good approximation, given by 

pM _ , k | < „ ( 22 ) 

where 7 0 (a) is the imaginary Bessel function. Equation 
(22) may be recognized as the steady-state density 
associated with tracking a sync tone in the presence of 
additive Gaussian noise [11], This implies that the optical 
sync channel performs identical to the microw r ave sync 
channel for reasonably high (a > 3) count rates. 

An average timing-error probability PE can be com- 
puted by averaging the PE | A in Figs. 5 and 7 over the 
random timing errors, using the density p(A) obtained 
from (20). That is, 

PE =* f LPE | A]p(A) dA. (23) 


IEEE TRANSACTIONS ON COMMUNICATIONS, APRIL 1972 



Fig. 8. Probability densities of tracking errors. 



Fig. 9. Average error probability versus signal count for PPM. 

A' — noise count; a — sync signal count. 

The integral in (23) was evaluated using a point-by- 
point integration of the densities in Fig. 8. The results 
are plotted in Fig. 9 for PPM and in Fig. 10 for on-off 
keying, showing PE as a function of transmitted data 
signal count S, for a fixed background noise count N 
and several signal counts a in the sync channel. The 
results indicate the average effect of imperfect timing, 
exhibiting the usual falloff in error probability with 
increasing signal energy, followed by a flattening (Fig. 9) 
and bottoming (Fig. 10) of performance as S is increased. 
The values of the minimum PE depends upon the tracking- 
loop signal count. In PPM the minimum asymptotes 
plotted in Fig. 9 are those obtained by averaging the 
S = oo curve in Fig. 5 over the densities in Fig. 8. In 
on-off keying PE actually begins increasing after achieving 
a minimum value, even though S continues to increase. 


OAOUAHDI: TIMING ERRORS IN OPTICAL DIGITAL SYSTEMS 


93 



Fig. 10. Average error probability versus signal count for on- 
off keying. N — noise count; a — sync signal count. 

This is due to the fact that the system is more “mis- 
matched” in threshold design at the higher values of S. 
This latter fact tends to favor PPM operation over 
on-off keying when combating imperfectly timed systems. 
This bottoming of PE in both systems is extremely 
important since it represents a residual nonreducible 
error probability that depends only upon the sync system, 
and cannot be overcome by increasing the bit energy 
to the data signal. For example, we see from Fig. 9 that 
with a = 5 and N = 0.5 we can never achieve an error 
probability less than 2 X 1(T 3 , no matter how much 
pulse energy we transmit. To determine these residual 
values for other design parameters, the curves for PE I A 
must be first generated then averaged as in (23). 

It may be pointed out that the same residual effect 
due to imperfect tracking occurs in the additive Gaussian 
noise channel (microwave system instead of optical) 
when using phase-shift keyed binary transmission. In 
this latter case, as data signal energy becomes infinite, 
PE — » 0 as long as the tracking error is less than t/2 rad, 
and PE — ♦ 1 for y? > r/2. Thus the residual error prob- 
ability is simply the probability that the loop error 


exceeds t/ 2. In Fig. 5 we see that as S — * °° we do not 
obtain zero PE (except at t = 0), and the residual PE 
tend to be higher than the comparable microwave case. 
That is, to obtain the same residual PE, the optical 
system requires more sync power. 


References 

[1] Proc. IEEE (Special Issue on Optical Communication), vol. 
58, pp. 1407-1786, Oct. 1970. 

(2) R. Gagliardi and S. Karp, “M -ary Poisson detection and 
optical communications,” IEEE Trans. Commun. Technol., 
vol. COM-17, pp. 208-216, Apr. 1969. 

(31 S. Karp, E. O’Neill, and R. Gagliardi, “Communication 
theory for the free-space optical channel,” Proc. IEEE, 
vol. 58, pp. 1611-1626, Oct. 1970. 

[4] S. Karp and J. R. Clark, “Photon counting: A problem 
in classical noise theory,” IEEE Trans. Inform. Theory, 
vol. IT-16, pp. 672-680, Nov. 1970. 

[51 W. K. Pratt, Laser Communications. New York: Wiley, 

1969, ch. 9. 

[6] R. Gagliardi, “Photon counting and Laguerre detection,” 
IEEE Trans. Inform. Theory (Corresp.), vol. IT-18, pp. 
208-211, Jan. 1972. 

[7] M. Haney and R. Gagliardi, “Optical synchronization-Phase 
locking with shot noise processes,” USCEE Rep. 396, Aug. 

1970. 

[8] I. Gradshteyn and I. Ryzhik, Tables of Integrals, Series, 
and Products. New York: Academic Press, 1965. 

[91 M. Abramowitz and I. Stegun, Eds. Handbook of Mathe- 
matical Functions (Applied Mathematics Series 55). 
Washington, D. C.: NBS, 1964, Table 26. 

[10] D. Middleton, Introduction to Statistical Communication 
Theory. New York: McGraw-Hill, 1960, ch. 10. 

[11] A. Viterbi, Principles of Coherent Communication. New 
York: McGraw-Hill, 1966, pp. 86-96. 


V 


f ' Rober^d^GaeUardi (S’57^P6^ bom 

received the B.S. degree in electncaTengineer- 
ing from the University of Connecticut, 
Storrs, in 1956, and the M.S. and Ph.D. 
degrees in engineering from Yale University, 
New Haven, Conn., in 1957 and 1960, re- 
spectively. 

From 1958 to 1960 he was an Instructor 
at the New Haven Engineering College. In 
1960 he joined the Information StudySection, 
Space System Divison, Hughes Aircraft Company, Culver City, 
Calif., where he was involved in problems in telemetry and com- 
munication systems. He is presently an Associate Professor in the 
Department of Electrical Engineering, University of Southern 
California, Los Angeles, and a Consultant to the Hughes Aircraft 
Company. 

Dr. Gagliardi Is a member of Eta Kappa Nu, Tau Beta Pi, and 
Sigma Xi. 


A tS- O ?t£o/ 


Reprinted b\ permission from 
IEEE TRANSACTIONS ON COMMUNICATIONS 
Vol. COM-22. No. 10. October 1974 

Copyright © 1974. hy the Institute of Electrical and Electronics Engineers. Inc 
PRINTED IN THE USA 


JOTS'- !«! tel 


Synchronization Using Pulse Edge Tracking in Optical 
Pulse-Position Modulated Communication Systems 

R. M. GAGLIARDI, member, ieee 

Abstract — A pulse-position modulated (PPM) optical communi- 
cation system using narrow pulses of light for data transmission 
requires accurate time synchronization between transmitter and 
receiver. The presence of signal energy in the form of optical 
pulses suggests the use of a pulse edge-tracking method of main- 
taining the necessary timing. In this report the edge-tracking 
operation in a binary PPM system is examined, t akin g into account 
the quantum nature of the optical transmissions. Consideration is 
given first to “pure” synchronization using a periodic pulsed in- 
tensity, then extended to the case where position modulation is 
present and auxiliary bit decisioning is needed to aid the tracking 
operation. Performance analysis is made in terms of t imin g error 
and its associated statistics. Timing error variances are shown as 
a function of system signal-to-noise ratio. 


I. INTRODUCTION 

The successful operation of any digital communication system 
requires accurate time synchronization between the transmitter and 
receiver. In optical digital systems a common procedure is to use a 
noncoherent pulse-position modulation (PPM) mode of operation 
using narrow pulses of light intensity to carry the data [1]. The 
presence of signal energy in the form of optical pulses suggests the 
use of a pulse edge-tracking method of maintaining the necessary 
time synchronization. In pulse edge tracking the edges of the trans- 
mitted pulses are used as timing markers to adjust the synchroniza- 
tion of the receiver. When the optical pulses are transmitted as a 
periodic pulse train of known fixed frequency, the edge tracking 
corresponds to “pure” synchronization, in that the transmitted 
edges always occur at periodic points in time. W T hen position modula- 
tion is present, however, the pulses of light are shifted according to 
the data, and the edge-tracking operation must be modified in 
order to maintain receiver timing. The latter type of synchronization 
is often called modulation-derived synchronization, or "impure” 
syncing, since the timing must be derived from, or accomplished 
in the presence of, the data modulation. In this paper we examine 
the pure and impure edge-tracking operation in an optical binary 
PPM system, taking into account the quantum nature of the light 
transmission. Performance comparisons are made in terms of the 
instantaneous timing error of the receiver and its associated statis- 
tics. The effect of imperfect timing on the overall data decoding 
operation has been studied elsewhere [2] and will not be considered 
here. 

The time synchronization problem has of course received consider- 
able attention in the past for the additive Gaussian noise channel, 
and the interested reader is referred to the presentations in recent 
books by Stiffler [3], Lindsey [4, ch. 3], and Lindsey and Simon 
Q5]. Although the approach here parallels these earlier studies, the 
quantum nature of the optical channel produces equations signifi- 
cantly different than those of the purely Gaussian channel. Similar 
mathematical differences were previously observed with the optical 


Paper approved by the Associate Editor for Communication Theory 
of the IETkE Communications Society for publication without oral 
presentation. This work was sponsored by the National Aeronautics 
and Space Administration, under NASA Contract NGR-05-108-I04. 
This arant is part of the research program at NASA's Goddard Space 
Flight Center. Ureenbelt, Mil. Manuscript received November 27, 1972; 
revised June 5. 1974. 

The author is with the Department of Electrical Engineering. Uni- 
versity of Southern California. Los Angeles. Calif. 90007. 


IEEE TRANSACTIONS ON COMMUNICATIONS, OCTOBER 1974 


1094 


optics! 


field 



decoded 

data 


Fig. 1. An optical digital PPM receiver. 


channel when considering detection [6], waveform estimation [7], 
and sinusoidal phase tracking [8]. The results here for pulse syn- 
chronization are reminiscent of (but not identical to) the effects of 
tracking in an additive shot-noise environment [9]. 

II. SYSTEM DESCRIPTION 

A block diagram of an optical digital system is shown in Fig. 1. In 
normal operation the incident optical field is photodetected, and the 
recovered signal is processed in both a data detection channel and 
in a synchronization channel, the latter providing the timing for 
the former. In this paper only the synchronization subsystem will 
be considered. In a PPM noncoherent mode of operation, digital 
information is transmitted by position modulating a pulse of light 
intensity during each digital word interval. Thus, in a binary system, 
the light energy is transmitted in one of two adjacent bit subinter- 
vals, representing a binary one or binary zero, as shown in Fig. 2 (a). 
Detection in the data channel is made by photoelectron counting 
(physically, short-term integration of the photodetector output, 
which Ls equivalent to energy detection of the optical field) during 
each subinterval, deciding on the position with the highest count as 
containing the transmitted bit. Timing for the starting and stopping 
of each counting interval is provided by the synchronization sub- 
system, and timing errors (offsets between received and integrated 
bit intervals) lead directly to system degradation. Continual timing 
information is necessary in order to maintain bit timing in spite of 
the time delay variations that may occur during optical transmission. 

The system of Fig. 1 can operate with one of two different syn- 
chronization formats. For pure sync operation, an unmodulated 
sync signal (herein considered as a periodic train of optical pulses 
at the bit-rate frequency), shown in Fig. 2(b), is transmitted inter- 
mittantly in place of the data to allow receiver lockup, and the 
resulting timing is used to decode the subsequent data transmissions 
until the system is retimed with the next sync burst. In impure sync 
generation, the PPM data are transmitted continuously and the 
timing is extracted from the data. The first procedure allows pure 
synchronization but must sacrifice data during the timing operation. 
The second method allows uninterrupted data transmission, and is 
obviously the preferred method of operation, but requires modula- 
tion-derived synchronization. For this reason considerable interest 
exists in developing the latter system and in determining its achiev- 
able performance. 

In pulsed optical systems both the pure and impure sync systems 
can employ edge tracking for timing. An edge-tracking subsystem 
makes use of the fact that a pulse edge always occurs at the center 
of each bit interval (see Fig. 2) and can therefore be used to sync 
an identical pulse train at the receiver. The subsystem, to be de- 
scribed in Sections III and IV, employs a feedback loop to essentially 
measure timing errors between the received and receiver pulse edges, 
using the error to correct the latter signal. In pure synchronization 
the pulse edge at the center always corresponds to the trailing edge 
of a pulse. When PPM is present, however, the edge may represent 
either a leading or trailing edge, and this polarity must be deter-^ 
mined for successful loop operation. To accomplish this, the sync 
subsystem in the modulated case employs an auxiliary decision- 
making loop that operates in conjunction with the edge-tracking 
loop (see Fig. 7). This auxiliary loop essentially decides which type 
of edge (i.e., which data bit) is being received, using the decision to 
augment a delayed version of the standard edge-tracking loop. 
Similar systems have been previously proposed [10], 


2P 


binary one 


2P - - 


0 W 2W 


binary aero 


(*> 



(b) 

Fig. 2. Intensity waveforms, (a) PPM bit intensities, (b) Pure aync 

Intensity. 


Continual or updated timing is necessary to overcome the unin- 
tentional variations in transmission delay, due to Doppler, relative 
receiver motion, etc. If the basic assumption is made that these delay 
variations are slow relative to the optical pulsewidth, then their 
only effect is to vary the time location of the optical pulse without 
distorting its shape. Thus, if /(f) represents the light intensity at 
the receiver with no delay variations, and if n is the time-varying 
delay occurring, then the recovered field intensity is given by 
/ (f — ti). Here it is tacitly implied that ti is a function of f which 
changes slowly with respect to the pulsewidth of /(f). Note, this 
latter condition is equivalent to the assumption that the band- 
width of ti is much smaller than the bandwidth of /(f). The principle 
objective of the edge-tracking loop is therefore to “track out” the 
unintentional time variations of r, generated during the transmission 
of the optical field. 

III. EDGE TRACKING OF A PERIODIC PULSE TRAIN 
(PURE SYNC) 

In this section we first examine the edge-tracking operation when 
the received intensity corresponds to a periodic pulse train of light. 
This would represent the situation during pure synchronization 
operation when the data modulation is not present [actually, a 
periodic pulse train at the bit frequency can be considered as a con- 
tinuous sequence of the binary one symbol in Fig. 2 (a) 3- The received 
field intensity with no delay variations is therefore given by 


CONCISE PAPERS 


1695 


• ync - 
input 


dc 


■e- 


pul»e edge 
integrator 

3W/2 

w J w/z 


• tart of 
integration 


e(t> 



Fig. 3. Pulse edge-tracking subsystem. 


timing 

marks 


Id) = P[l +p(0] (1) 

where P is the average received field power per unit area, and pit) 
is the effective intensity modulation 

f 1 0 < t < W 

Pit) - (2) 

l-l W < t < T. 

Here W is the pulsewidth and T = 2\V is the bit period. The above 
intensity is assumed to be received with the delay variations n> 
which is equivalent to replacing p(f) by p(t — ti). 

The output of the photodetector in Fig. 1, operating over a single 
spatial mode of the optical field, is known to be the shot-noise current 
process [11] 

W(O.I) 

t (0 - ffe I id - U) (3) 

■••I 

where 4(1) is the detector impulse function, e is the electron charge, 
G is the photomultiplication gain, 1^1 are the random event times, 
and X(0,t> is the electron Poisson counting process [i.e. , -V (0,0 is 
the number of electrons occurring in the interval (0,1) ]. The counting 
process has its average value related to the received field intensity 
Id) by 

avg .V (0,1) = H I (Id) + k,) dl (4) 

■'o 

where B = rj.-i /hf, A is the detector receiving area, ij is the receiver 
efficiency parameter, h is Planck’s constant, /is the optical frequency, 
and 0k„ is the average rate of arrival of background noise photons 
over the detector area. The photodetector output i(l) of (3) will 
have added to it a white Gaussian circuit noise current i„(l), and 
the resulting signal, x(t) & i(l) + j"„( 0, provides the input to the 
synchronization edge-tracking subsystem, shown in detail in Fig. 3. 
A pulse-edge integrator is time controlled by a receiver timing oscil- 
lator, generating an error voltage used to readjust the oscillator. 
The latter, in addition, provides the timing markers for the data 
channel. The pulse-edge integrator consists simply of a IF-seconds 
integration offset over the trailing edge of the received pulse. If the 
input to the integrator was the pulse train of Fig. 2(b) with zero 
average value (i.e., with its dc value removed) the offset integration 
occurs over portions of positive and negative values. If this latter 
integration had been timed to begin exactly halfway through the 
positive pulse (at t = W/ 2), the resulting integrated error value 
would be zero, no oscillator correction is necessary, and the system 
is in time sync. If a time difference occurred between start of inte- 
gration and pulse half-interval point, a proportional error signal 
would be generated whose polarity depended on the direction of the 
time difference. This error voltage can be used to adjust the loop 
timing oscillator. The input dc removal can be accomplished easily 
by capacitor-coupled circuits, but is represented as a subtraction in 
the mathematical model of Fig. 3. Unfortunately, in the optical 
system of Fig. 1, the input to the loop is not a clean pulse sync train, 
but rather the shot-noise process of (3), containing the optically 
pulsed intensity of (1). In addition, this shot noise has added to it 
the additive circuit white-noise current i„(<). Hence, the error signal 
generated after the short-term loop integration is 

e(0 = 7T r I [»(<) +»»(<) — Ge0(P + fr,,)]dt 

rr Jn 

+ 1F l fri+W 

i(t) dt + — / t„(<) dl - Ge0(P + k„) (5) 

H J r\ 



where ti represents the start of the loop integration; i.e., the timing 
of the loop. The subtraction in (5) represents the removal of the 
average intensity from the loop input. The dependence of the right- 
hand side on l is implicit in the parameter ti, which varies as the 
loop attempts to track out the variations r t . Although ti is actually a 
function of l it is treated as a constant when integrating over the 
pulsewidth W seconds long. The latter fact is simply a restatement of 
the fact that the bandwidth of ti, which is roughly the same as that 
of n, is much less than the repetition frequency 1 /W. After sub- 
stituting the input processes, (5) can be rewritten as 

«(0 “ScWCrnn + HOD + ntT,) -Gep(P + k.) (6) 


where n(ri) is the Gaussian circuit noise random process, obtained 
by integrating t„(() over (ti,ti + W), and has zero mean and 
variance X U /2W, with A'„ the one-sided circuit noise spectral level. 

The performance of the tracker can be directly related to the 
instantaneous timing error between the received and the oscillator 
signal. This timing error is defined by 

t A n — ti (7) 

where all parameters are actually functions of t. Using straightfor- 
ward analog loop analysis, and recalling that the oscillator phase 
depends on the integral of the voltage controlling it, the timing error 
t in (7) satisfies the integral-differential equation 


dr dn 

dt = It 


Ke(t), 


( 8 ) 


where K is the total loop gain. Since the error signal e(t) depends 
on both T, and ti, the equation is in general nonlinear in t. Clearly, 
the solution for t (() in (8) necessarily evolves as a stochastic process 
due to randomness of ell) in (6). This is true even if the additive 
circuit noise i„d) is set equal to zero (i.e., only background inter- 
ference) due to the randomness of the shot-noise process. 

Although the probability densities of t(() will be of ultimate 
interest, the behavior of the instantaneous mean value of t(/) can 
be derived from (8). If we statistically average both sides, inter- 
changing differentiation and averaging on the left, we obtain the 
equation 


df(0 = dMO _ _ 

dt dl 


(9) 


where the overbar denotes statistical average. Since the additive 
noise variable in (6) has zero mean, the mean-error voltage is given 
by the mean shot-noise count. The latter is the integrated count 
intensity over the integration intervals, as denoted by (4). Hence, 

(GeB ( r '* w 

e(t) = 6r j— J [/(i) + fc*]dt - GcBiP + fc„) 

= e, j^r^' ,+,F [P[i +pd - r ,)] + *.]* - GeB(P + *.)} 

<i0 ’ 

where C, is the expectation operator over the random variable t, 
and p (t) is given in (2). The above integral can be rewritten in terms 
of a receiver correlation. Define the function ;/(() by 


1096 


1 


0 < t < W 
elsewhere. 


IEEE TRANSACTIONS ON COMMUNICATIONS, OCTOBER 1974 


v(0 


Then (10) becomes 

«7T) - ^\tpGPR, p (r)\ 


( 11 ) 


( 12 ) 


where 

R„(t) [“ p(l — n)y(l — n) dt. ( 13 ) 

W J — 

Hence, the mean of the error process can be related to the correlation 
of the periodic intensity modulation p(t) with the time function 
y(t). The latter function can therefore be considered as the receiver 
"timing” signal produced by the loop. The correlation function 
R„( r) for the functions p(<) in (2) and y(<) in (11) is plotted in 
Fig. 4. This correlation function is the mean-error function of the 
tracking loop, and is often referred to as the loop “S curve.” 
Equation (9) therefore becomes 


df(t) = df,(t) 

dt = dt 


(e0KGP)^\R p? {r)\. 


(U) 


The above is the differential equation of the mean timing error 
variable in the tracking loop. If r(l ) is confined to the linear range 
of (i.e., if t = 0 and the loop is tracking well) then we can 

approximate R, P (r) = 2t/W and e,|/f V p(r) | = 6,[2 t/W] - 2f/W. 
Equation (14) then becomes a linear differential equation in terms 
of the mean-error process ?(<). Furthermore, this linear equation 
corresponds to that of the linear feedback system in Fig. 5. The latter 
is often called the linear mean equivalent loop to Fig. 3, and is 
useful for analyzing or synthesizing based upon the mean timing 
error process. Note that in this equivalent system, the input delay 
variation n appears as the loop input, and the loop timing oscillator 
becomes a feedback loop integrator whose output is the timing 
process n (<) . The linear equivalent loop has a loop gain of 2 eGKffP/W 
and a loop bandwidth 1 of 




eGKffP 
2 W 


( 15 ) 


Note that the loop bandwidth depends directly on the received opti- 
cal power P, which therefore appears as a parameter of the equiva- 
lent system. The loop bandwidth must be sufficiently wide (i.e., 
there must be sufficient loop gain) to track the expected time varia- 
tions in TJ. 

Although mean-error performance in tracking the received delay 
variations can be determined from the linear mean system, the 
adverse effects due to the random nature of the optical field and 
circuit noise cannot be derived (note that the linear system is 
noiseless). In this case, the dynamical equation of the true system, 
(8), must be examined in detail for a complete statistical analysis. 
Unfortunately, the discrete nature of the timing error equation 
indicates that the statistics of the solution r(f) will be highly non- 
stationary as the process evolves in time. An indication of the sta- 
tistical properties of r«) can be obtained by examining the steady- 
state probability density of t. This latter density, /(r), is known to 
satisfy the Kolmogorov-Smoluchowski steady-state equation £4, 
Ch. 7]: 



Fig. 4. Tracking error characteriatlc for pure «ync. 



Fig. 6. Linear equivalent edge-tracking loop. 


with variable coefficients and, in general, involves all orders of 
derivatives. The principle usefulness of (16), however, occurs when 
only the first few coefficients are nonzero. In particular, if K, (r) =0, 
j > 3, the resulting equation is the steady-state Fokker-Planck 
equation, and has been extensively studied [4, Ch. 7]. A Fokker- 
Planck equation implies a "continuous” process; i.e., processes that 
do not change significantly over a short-time period, while the more 
general equation of (16) would be associated with processes con- 
taining statistical jumps. 

The calculation of the sequence of coefficients A'y(r) requires 
determination of the moments of the error increment At in (17). 
Consider again the system of (8) when tracking an intensity pulse 
having a constant-time shift t,{1) = r 0 . The timing error r(() 
therefore satisfies (8) with dn/dt = 0. The timing error variation 
A( is then 


“ (- 1 )' 3 ’-' 

2 [*i (*)/(*) ] - °, (16) 

/-l ;! (6r)» 1 

where 

K,( t) - lim (17) 

Al— 0 Af 

At* - [t(! + Af) - r(0] 

with suitable initial conditions and with the condition that fir) 
integrate to one. When the coefficients A;(r) exist, this equation 
provides a relation that must be satisfied by the steady-state density 
of the process r(t). The equation is a partial differential equation 

1 In a linear feedback system. If Jits) Is the transfer function from loop 
Input to feedback signal, then the loop bandwidth is dehned by Bl = 
/ I Htjtii) \>(Uo/2r. It essentially represents the bandwidth that the loop 
exhibits to the input. 



The coefficients in (17) can now be determined by using (6) in (18). 
Unfortunately, an exact calculation of the coefficients is hampered 
by the sampled data (short-term integrated) nature of the loop. 
(The integrator smooths the error signal, and produces in effect a 
second-order loop.) We shall consider instead a continuous firsts 
order loop in which the integrator is neglected, and the resulting 
time averaged (over a period If ) coefficients are used as an approxi- 
mation to the desired steady-state coefficients. This is equivalent to 
assuming that the integration over the pulsewidtli is extremely 
short relative to the time variations of the input process, and can 
be neglected. The subsequent time averaging of the coefficients is 


» CONCISE PAPERS 


1697 


similar to the smoothing produced by the loop. It is shown in Appen- 
dix A that the coefficients computed in this way become 


K,( r) 


— (Gekf}P ) R tp ( r) , j- 1 

+ N„/2(G*)’], j = 2 (19) 

j>3 


where 





»'(* ~ T|)[/(f - T|) + 


( 20 ) 


For the loop signal y(() in (11), the above can be further simplified 
by noting that for all j 

R.n(r) - R„(r). (21) 

Thus, the steady-state Kolmogorov equation becomes 

0 - (eGK$P)lR„(T)f(T)l + f 

Z dr 

•|[0«»/(r) + iV./2(G«)*]/(t)| 

” ( eGK )' d’~ l 

+ 2— — — - |0ff„(r)/(r)|. (22) 

,-j J! dr 1 1 

The infinite number of derivatives manifests the discontinuities of 
the error process caused by the quantum nature of the detected 
optical process. It is the form of this equation vis-a-vis the Fokker- 
Planck equation that theoretically separates optical tracking from 
tracking in additive Gaussian noise. This complication was pre- 
viously noted by Ohlson [9] when dealing with added shot noise.’ 
Although an exact solution for /(r) is somewhat ambitious, some 
meaningful information and approximating solutions ran be derived. 
In particular, consider the case where the system operates in near- 
lock operation, so that it may be assumed that r »0. The instan- 
taneous tracking error can therefore be considered to be confined to 
the linear range of /?» P (r) and 

R„( r) « 2r/W 

R,i(t) « (P + k„). (23) 

Substituting into (22) and dividing by the coefficient of the second 
term yields the modified equation 

o “ <*rf(r) -|» ~-/(t) + 2 A, (24) 

dr i-, dr'-' 

where 


2 BP 

{WeGK/ 2 )[fi(P + *„) + A ',/2 ((?«)*] 


(25) 


±\-‘ r <*(/> + k„)(^p)--« i 

Wa) [l>(P + *„) + A'oAGe^J-tJ' <26) 

Note that the coefficients A, vary as 1 exhibiting a decreasing 
importance of the higher derivatives as the parameter a in (25) is 
increased. (The bracketed term in A, is bounded by one and ap- 
proaches one as the system approaches quantum limited operation, 
i.e., when 0P » 0k n -f .V 0 /(Gc) ! 2.) A ph ysicai interpretation to a 
can be introduced by noting the linear mean-equivalent loop of 
Fig. 5. Since it is often desirous to operate the tracking loop with a 
given loop bandwidth B the loop gain K is generally adjusted so as 
to obtain this value in (15). Thus, if B L is the desired bandwidth, 
then we set K - 2B L W/Ge0P, and (25) becomes 



2_ (0P)> 

IF’ \Bl\_0(P + k„) + .V„/2(Ge)«] ' 


(27) 


When written in this way, the numerator in the braces is the square 
of the mean intensity of the received signal, while the denominator 


* ft should be noted that |9| considered the case of phase tracking a 
desired sine wave with additive shot noise The optical model here in- 
a " '. n .P ut ■"«>» process with signal imbedded within, as indicated 
d/ff eren t coefB cients * ** ac to ,n ^ n ‘ le order equations but with slightly 


is effectively the total noise power occurring in the bandwidth B L 
(since the level 0(P k„) is the two-sided shot-noise spectral 

level). Thus, a is proportional to a signal-to-noise power ratio, and 
the coefficients in A, in (26) vary as an inverse power of this ratio. 
We therefore expect that solutions for /(r) can be suitably approxi- 
mated by solving truncated versions of (24) with fewer and fewer 
terms, as we increase the optical signal-to-noise ratio in (27). 

Further properties of the solution for /(r) for the in-lock operation 
case can also be derived. Transforming both sides of (24) indicates 
that the characteristic function of /( t), *>(w), satisfies the differen- 
tial equation 


dtfi(u) 

j dh> 


+ ju<p(u>) 


+ 2 Ai + i(j«)V(u) 


This means 


0. 


In v(u>) 


if!. + I 2 

2a a i - 1 t + 1 


+ C 


(28) 


(29) 


where C is chosen to satisfy the unit area constraint on /(r). Since 
the righrihand side is in the form of a power series in jw, the semi- 
invariants of the steady-state density can immediately be identified. 
Note that the first semi-invariant (mean value) is zero, the second 
semi-invariant (variance) is 1/a, and the higher semi-invariants 
are related to the |Ai|- Thus, the actual form of the solution density 
depends on these coefficients. As a limiting case, however, we see 
that if a — * °°, implying At/a — » 0 for all i > 3, then In v'(w) = 
— w’/2 a, corresponding to a zero mean, Gaussian density for r, 
having variance 1/a. On the other hand, for a < <= the higher 
coefficients can no longer be neglected, and the complete series in 
(29) must be included. Of course, as a is decreased in value, the 
increasing variance will cause the loop error to exceed the linear 
range of R, p (t), violating our assumption that the loop is in fact 
completely linear. Nonetheless it is important to recognize that- even 
though the loop error density varies in form from the asymptotic 
Gaussian density for a — » °° to the more complicated density defined 
by (29) as a — > 0, the variance of the density for the linear system is 
always 1/a Qi.e., the second semi-invariant in (29) ]. Assuming shot- 
noise limited operation [ 0(P + k n ) » iV»/(Ge)*2], the loop error 
variance for the normalized delay variable x = r/lf is therefore 



where 

r - 20 P/B l . (31) 

The above is plotted in Fig. 6 as a function of the parameter r for 
several values of normalized noise energy 0kn/Bi. The curves, in 
essence, summarize the performance of a pure sync system operating 
with optical power P watts in a tracking bandwidth of Bl Hz. The 
rapid increase in the normalized error variance as the parameter r 
is decreased represents the deterioration of the timing performance. 
The presence of background noise (i„) causes the increase to occur 
at higher values of r. Since 20 P represents the rate of occurrence of 
signal photoelectrons during pulse transmission, the parameter r in 
(31) can be considered as an indication of the “denseness” of signal 
counts, indicating the accumulation of electrons over a time period 
equal to the reciprocal of the loop bandwidth. Alternatively, by 
substituting for 0, we can write r = r\A2P/hfB L , which has now 
the familiar interpretation as the ratio of total received pulse power 
to the quantum noise in the loop B L bandwidth. The ratio 0 k„/B L 
has a similar interpretation in terms of received background noise 
power. 

It should be pointed out that the variances plotted in Fig. 6 
correspond to the relatively simple tracking loop in Fig. 3. Some 
improvement in performance can often be attained by designing 
more complicated tracking systems. For example, by simultaneously 
processing with a parallel integrator over the leading edge of the 
subsequent sync pulse [i.e., over the time interval (3W/2, 5U’/2) ] 
one can integrate and negatively combine with e(t) in Fig. 3 to 
strengthen (double) the error amplitude. This effectively doubles 
the signal power and theoretically produces a 3-dB power savings. 


1698 


IEEE TRANSACTIONS ON COMMUNICATIONS, OCTOBER 1974 



Fig. 6. Variance of normalized error r/H' versus loop signai-to-nolse ratio T. pure sync, (r - 2 $P/Bl.) 


The interested reader is referred to [6] for further discussion of these 
possible modifications. 


IV. MODULATION-DERIVED EDGE TRACKING 
WITH PPM 


In a PPM system the received optical intensity is no longer 
periodic, but varies in position according to the data bit sequence. 
For example, in binary PPM, if the optical intensity is written as 
in (1), then its modulation during a bit period is given by p(t) in 
(2) if a binary one is sent, but is given by — p(<) if a binary zero is 
sent, as it is obvious from Fig. 2 fa). A receiver attempting to attain 
time synchronization by edge tracking the center transitions during 
each bit period will be adversely affected by the data modulation. 
If a datum one is sent, a timing error r will generate a loop error 
voltage of Rvp(t), as discussed in Section III. However, if a datum 
zero is sent, an error voltage of ~R vr {r) is generated in the same 
loop. Hence, for equally-likely data bits, the average error voltage 
within the loop is then [P„,,(t) (probability of one being sent) — 
R v ,,It) (probability of zero being sent) ] = 0. That is, on the average, 
no loop error is generated for controlling the receiver timing oscillator 
during modulation reception. 

To compensate for this modulation, an augmented edge-tracking 
system can be used, as shown in Fig. 7. 3 The decision loop attempts 
to determine the true data bit, using this decision to properly modify 
the sign of the loop error voltage. This can be implemented by multi- 
plying the generated error in a delayed (by one-bit period) tracking 
loop by a plus or minus one, depending on the data bit. The latter 
decision is made from a count comparison over each possible bit 
subinterval as they arrive. Thus, the error in the delayed edge- 
tracking loop becomes be(t), where 


of equivalently, 



if one is sent, 
if zero is sent, 



if k, > k, 
if k\ < ki 


(32) 


(33) 


where k,, k° are the counts over the first and second subinterval of 
each bit period. The differential equation in (8) for the tracking 
loop error now becomes 


dr 

dt 


- * 0 ( 0 ]- 

at 


(34) 


■ Again we point out that even in the modulation case more compli- 
cated tracking loops can be derived 15) which achieve performance 
improvement by differencing adjacent bit intervals. 


Since the counts in (33) are random counts, the parameter b is a 
random variable. Thus, the coefficients A',-(r) in (17) for the steady- 
state density will be a function of this variable, and therefore require 
a subsequent average over its statistics. When a one is sent the 
probability that k, > k, is equivalent to the probability that the 
one is correctly detected, whereas the probability that k, < kt 
corresponds to the probability that an error is made. Hence, when 
a one is sent, 


+ 1 with probability 1 — PEi 
— 1 with probability PEi 


(35) 


where PE, is the bit error probability when a one is sent. When a 
zero is sent, the above signs are reversed and PE, is replaced by 
PEo. It should be remembered, however, that the timing for this 
subinterval counting is in turn controlled by the receiver loop timing 
signal, which will have loop timing errors incorporated within it. 
Thus, the bit-error probabilities in (38) must include these timing 
error effects. (A timing error between the true bit-arrival time and 
the start of subinterval counting will cause the counting to occqr 
over an offset interval.) The effect of these timing errors on bit 
decisioning has been previously derived, [2], and a typical average 
bit error probability plot of PE = \[PE, + PE 0 ] is shown in Fig. 8, 
as a function of the timing error t, optical pulse energy S = 2(jPW, 
and noise rate (ik„. This timing error t is in fact a function of time, 
but. can be considered a constant over several bit periods. 

The steady-state coefficients A',(r) can be evaluated from (17), 
(19), and (34) by first conditioning on b and then averaging over 
the probabilities in (35). Using primes to denote the K, coefficients 
when data modulation is present, and noting that b> = 1 for all j 
even, we see that 


A7(r) 


K, (t), j even 

JA,(r)|( + l)[l - PE,] + (-1 )PE, - (-1)[1 

— ( -b 1 ) PEo I, 

{ K, (r), j even 

A’,(r)[l - 2PE(r)], jodd 


— PEo] 
j odd 

(36) 


where PE(t) = \[_PE, + PEo]. Note that the dependence of PE 
on r has been emphasized. The resulting steady-state density equa- 
tion is again given by (16) with A',(r) replaced by the K/{t) above. 
Note that the coefficients are now more complicated functions of r 
due to the auxiliary decisioning, and approach the earlier results as 
PE(r) — *0. In this latter ease, the system is correctly identifying 
the true bit during each period, and essentially ' removing” the 
binary modulation. The first coefficient, 


CONCISE PAPERS 


1699 



dc 


Fig. 7. Modified edge-tracking loop for PPM sync. 

1 . 



AY(r) . GEK0PR„(r)[_\ - 2PE(r)l 

is the average loop error function, and represents the modified non- 
linearity of the mean equivalent loop, as in (12). This coefficient is 
plotted in Fig. 9 as a function of the normalized r and received pulse 
energy S, obtained by use of Figs. 4 and 8. Note that the effect of 
the decision process is to reduce the width and amplitude of the 
tracking error function. As PE It) — » there is no average error 

being generated for loop tracking, and the system essentially loses 
lock. 


For the near-lock assumption [use of (23)] the previous steady- 
state equation is modified to 

dfl t) » d‘~ l 

0-ar[l -2PE(r)]/(r) + r). < 37 ) 

ar ,-i dr 1 1 

Even with this simplification, neither the solution density nor its 
characteristic function, can be generated as easily as in Section III, 
since the first coefficient is now more complicated. However, the 
fractional variance for this density can be estimated by approximat- 
ing the coefficient AY It) in Fig. 9 by a sinusoid of proper amplitude 
and frequency. This latter amplitude will depend on the energy S 
per data bit used for decisioning, which in turn is related to the r 
parameter in (31 ) by 

S=(Bj.W r )r (38) 

where B L W = B L /2Rb = l(B L /Rb). The parameter i B L /Rb) is the 
ratio of tracking loop bandwidth to the data bit rate Rb, and is 
typically less than one. When written as in (38), B L \V can also be 
interpreted as the fraction of the sync energy r appearing in the 
data pulse and therefore used in the auxiliary decisioning. For a 
fixed value of BlW, each value of r generates a corresponding value 
of S, to which an effective one cycle sine wave can be fitted to the 
corresponding curves of K,'( t) as in Fig. 9. The variance can then 
be determined at each r by numerically solving a truncated version 
of (37), using the method discussed in Appendix B. The resulting 
normalized variance computed in this way is plotted in Fig. 10, as a 
function of r for several values of BlW. The curve for the noiseless, 
pure sync operation from Fig. 6 is superimposed. The results show 
that a deterioration of performance occurs over pure sync, due to the 
decisioning process, and can therefore be considered as the price to 
be paid for modulation derived synchronization. Note that the 
decisioning causes system degradation similar to an effective loss 
in signal-to-noise ratio (reduced r) and can therefore be interpreted 
as a power loss in the sync subsystem. 

Although the use of the curves in Fig. 10 are convenient for assess- 
ing performance, their derivation requires a somewhat lengthy 
calculation. Furthermore, this computation must be repeated at 
each desired value of background noise k„. How ever a simpler method 
can be used, at the expense of analytical accuracy, to derive similar 
curves. This method makes use of a form of truncated quasi-linear 
solution, which basically amounts to reducing (37) to a first-order 
equation, and replacing the first coefficient by a modified linear 
coefficient as in (24). bill retaining its dependence on the decision 
error probability PE. To accomplish this linearization we first 


1700 


IEEE TRANSACTIONS ON COMMUNICATIONS, OCTOBER 1974 




FI*. 10. Variance of normalized error venui loop algnal-to-nolM T (PPM aync). 


recognize that PE depends on both pulse energy S, and timing error 
r, and we write this as PE(S,t). To linearize, we replace the func- 
tional dependence on r by rms value of r; i.e., *« (l/a) u *. Thus 

we consider instead PE[_S (l/a) 1 ' 1 ]. The quasi-linear differential 
equation for the timing error density /(r) is then taken as 


df( r) 


+ a 1 1 


2PE[S,(1/«)‘*]|t/(t) 


0 . 


(39) 


Note that the equation is linear in/(r), but the coefficients are non- 
linear in o. The solution for /(r) in (39) yields a Gaussian density 
with normalized variance 


1 

W'a\l - PE[S, (1 /a )*'»](• 


( 40 ) 


dr 


For the shot noise limited case, 1 /W'a is given in (30), S is related 


CONCISE PAPERS 


1701 


to r by (38), and the variance depends only upon the parameters 
T, BlW, and k„/B L . The values of PE at any value of S = 
and rmi can be obtained from curves similar to Fig. 8. Several 
points of the above variance for the shot-noise limited case are super- 
imposed in Fig. 10. The results tend to display the same behavior 
for tracking performance, although the variance values indicate 
slightly lower variances than the more accurate results determined 
earlier. 


APPENDIX A 

The computation of the coefficients in (17) is hampered by the 
sampled data nature of the tracking loop in Fig. 3. We consider 
instead a continuous first-order loop in which the sampler is neg- 
lected, and its time averaged coefficients will be used as an approxi- 
mation to the steady-state K,. This is equivalent to assuming that 
the short-term integration over the pulsewidth is extremely narrow 
relative to the time variations of the input process, and can be 
neglected. The error signal e(l) is therefore taken as the signal prior 
to the integrator, or 

e(0 - [*'(<) 4- »'.(<) - Ge0(P + k.)]t/(f - t,) 

where y(t) is defined in (11). Substituting for i(f) from (3), and 
evaluating (18) yields 

n r i+4i 

(-At) - C 2 y(t m - t i) + K / *.(*) dz - C0(P 4- k„)At 

• (Al) 


where C •= GeK and X = X (t,t + At), the latter defined in (6). To 
determine the X, coefficients, the time averaged moments of At 
must be calculated. The first two moments are as follows. Denoting 
statistical averages with overbars, we have • 


[-At] 


c 


2 y (tm — Tl) 

M 


l-f A I 


[u(z)]dz 


- C0(P + *.)At. ( A2) 


Now »„ (*) 0 and it is known that Poisson shot noise has mean 

[11, p. 1619] 

Zv(U-r,)-3 y(U - r,)[/(l_ - t.) +fc„]df m . (A3) 

m J I 

In the limit as At — » 0 
r i+ai 

lim / y(a — n)[/(a — n) + k*~]da 

AI-0 J I 

— (At)[y(t - t,)/(1 - t,) + *„]. (A4) 

The above term is a function of t. The first coefficient is derived as 
the time-average value over an integration period W, assuming that 
t does not change during this interval. Hence, 



- -C^ j “ y(t - ti) I ( t - t.) dt + CPp 

- -C £/“»(«- r, )[/((- r.) - />]«*. ( A5) 


The calculation of the mean-squared value of (Al) requires com- 
putation of the cross products involved. However, noting that the 
eventual computation of the K, requires a division by A / followed 
by a limit as A< — * 0, only the terms of order A( need be retained. In 
particular, we see from (A4) that any product of averages of the 
shot-noise summation will always be at least of order A P. Hence, 
we have 


C'0 


r 


yHU - t,)[/«- - t.) +k.-\dt +0(A*). (A7) 


Since the circuit noise is white with spectral level A'o, the second 
integral in (A6) is known to be A’’A’oAf/2 C12, p. 86]. Proceeding as 
in (A5), we have 

K , - C* " y'(t - t ,)[/« - t ,) + + KW./2- (A8) 

Similarly, we have 

(A tp = (— C)> 2 y' (l m - n) j + 0 ( Af ) *, j > 3. (A9) 

By manipulating as above we finally derive 

Ki - - T,)[/(f - r.) +*.]*. (A10) 


Equations (A5), (A8), and (A10) are summarized in (19) of the 
paper. We point out that the first coefficient above is the same as 
would be calculated for the sampled system, but the higher coeffi- 
cients would become extremely more difficult to determine. 


APPENDIX B 

The procedure here follows that of Ohlson [9]. Consider the 
equation 


df » d'-'f(r) 

0 = [aQ sin 2tt]/(t) +y- + 2.A, —j — - , | r | < W / 2 (Bl ) 

dr j ar 7 

which is a truncated version of (37) with Ki'(t ) — Q sin 2xt. We 
assume an even solution having the form 

/(r) cos t, I t | < IF/2 (B2) 

where 

/ wn /2rk\ 

f(r) cos ("^rj r dr. (B3) 


If we substitute (B2) into (Bl), collect harmonic terms, set the 
resulting coefficients equal to zero, we derive the following second- 
order recursive equations among the CV 




Ct. i + 


[ 


2 fc * 2A,k’ ~\ 

Qa Qa J 


Ct. 


(odd) 


( B4) 


The above allows a generation of all subsequent Ct from the first 
two, Co and C|. These latter two are found from the conditions that 
1) /(t) be a probability density over ( — (H r /2),W/2) and 2) for 
large a, f(r) must approach the known solution corresponding to 
.4, = 0 in (40). From (B3) we see that the first condition requires 
that Co = 1, while for A/ = 0, (B4) becomes 


Ct+i “ Ct., 
The solution is then 



Co - 1. 


(B5) 


It(Qa) 

h (Qc.) 


( B6) 


where It is the imaginary Bessel function of order k. Thus, C, was 
selected as I,(Qa)/lo(Qa) in subsequent analysis using (B4). The 
density /(t) ran now theoretically be constructed by solving (B4) 
and using the Ct in (B2). 

We are primarily interested in the normalized variance of this 
tracking error, given by 


o 1 



t*/(t) dr 


l+AI 

(At)' - C* 2 y’(f« — ti) j + A' 1 jji, (zi)».(z») dz, dz, + 0(A<)‘. 

1 (A6) 

The average of the first.summation is known to be [11, p. 1619] 


-+ 2 Ct 

12 t-> 


s: 


x * cos (2rkx ) dx 




(B7) 


1702 


The above ran be computed directly from the Ct generated from 
(B4). To examine the truncation error (B4) was numerically solved 
for Q in the range (0. 1—0.5 ) and JV = 3 and 5. For the range of 
interest (1 < a < 100) no noticeable change in variance appeared 
for N greater than 3. Hence, the truncation was limited to N = 3 
in all subsequent results. With N = 3, the variance was then com- 
puted from (B7) as a function of a, after generation of the C* from 
(B4). The value of Q, which itself depends on a, was determined for 
each a from the curves of Fig. 9. The resulting variance is plotted 
in Fig. 10 of the paper, assuming shot-noise limited operation. 

REFERENCES 

[11 8. Karp and R. M. Gagliardi. “The design of a pulse-position 
modulated optical communication system." IEEE Trans. Commun. 
Ttchnol.. vol. COM-17, pp. 670-670. Dec. 1969. 

|2| R. M. Gagliardi. "The effect of timing errors in optical digital 
systems," IEEE Trans. Commun., vol. COM-20, pp. 87—93, Apr. 
1972. 

[3] J. Stiffler, Theory of Synchronous Communication. EnglewOod 
Cliffs, N. J.: Prentice-Hall. 1971, pt. 2. 

[4] W. C. Lindsey, Synchronization Systems in Communications and 
Control. Englewood Cliffs, N. J.: Prentice-Hall, 1971. 

[5] W. Lindsey and M. Simon. Telecommunication Systems Engineering. 
Englewood Cliffs. N. J.: Prentice- Hall. 1973. ch. 9. 

(0) R. M. Gagliardi and 8. Karp. "A/- ary Poisson detection and optical 
communications,'' IEEE Trans. Commun. Tcchnol.. vol. COM-17, 
pp. 208-216. Apr. I960. 

17] J. R. Clark. "Estimation for Poisson processes with applications in 
optical communications.'' Ph.D. dissertation Mass. inst. Tech., 
Cambridge. Mass.. Sept. 1971. 

[8] D. L. Snyder and I. B. Rhodes. "Phase and frequency tracking 
accuracy in direct-detection optical communication systems." 
IEEE Trans. Commun. . vol. COM-20, pp. 1139—1142. Dec. 1972. 
19] J. E. Ohison. "Phase-locked loop operation in the presence of im- 
pulsive and Gaussian noise," IEEE Trans. Commun.. vol. COM-21 
* pp. 991-996. Sept. 1973. 

(10] w. C. Lindsey and M. K. Simon. "Data-aided carrier tracking 
loops," IEEE Trans. Commun. Technol.. vol. COM-19, pp. 157—168, 

Apr. 1971 

(11] S. Karp. E. L. O’Neill, and R. M. Gagliardi. "Communication 
theory for the free-space optical channel." Proc. IEEE. vol. 58. 
pp. 1611-1626. Oct. 1970. 

(12] A. Viterbi. Principles of Coherent Communications. New York: 
McGraw-Hill. 1906. 


